GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



February 15, 2005, 06:25:17 ; Search time 6778 Seconds 

(without alignments) 
13028.785 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-867A-281 
2320 

1 agggtcccttagccgggcgc tctctccccaacctcactaa 2320 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 34239544 seqs, 19032134700 residues 

Total number of hits satisfying chosen parameters: 68479088 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 1500 summaries 



Database : EST:* 



1 


gb_ 


estl:* 


2 


gb_ 


]est2:* 


3 


gb_ 


]htc:* 


4 


gb_ 


_est3 : * 


5 


gb_ 


_est4 : * 


6 


gb_ 


_est5:* 


7 


gb_ 


_est6 : * 


8 


gb_ 


gssl : * 


9 


' gb_ 


gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% ' 
Result Query 

No. Score Match Length DB ID Description 



1 


1180.4 


50. 


9 


1182 


9 


AY404343 


AY404343 


Homo sapi 


2 


912.2 


39. 


3 


2197 


3 


AK050128 


AK050128 


Mus muscu 


3 


857.4 


37. 


0 


1180 


9 


AY404344 


AY404344 


Pan trogl 


4 


853.2 


36. 


8 


879 


5 


BQ216829 


BQ216829 


AGENCOURT 


5 


754 


32. 


5 


759 


7 


CN396938 


CN396938 


170004177 


6 


738 


31. 


8 


916 


6 


CD050395 


CD050395 


AGENCOURT 


7 


733.8 


31. 


6 


1185 


9 


AY404345 


AY404345 


Mus muscu 


8 


731.6 


31. 


5 


1353 


3 


AK041045 


AK041045 


Mus muscu 





9 


522.4 


22. 


5 


978 


4 


BI102968 




10 


506.4 


21. 


8 


510 


7 


CN396939 




11 


490.6 


21. 


1 


717 


7 


CR769328 




12 


488.6 


21. 


1 


761 


4 


BI559553 


c 


13 


463.6 


20. 


0 


694 


2 


AW173071 


c 


14 


457.6 


19. 


7 


659 


6 


CB435346 




15 


452.6 


19. 


5 


1723 


3 


AK052644 




16 


452.2 


19. 


5 


771 


7 


CR767782 




17 


450 


19. 


4 


724 


6 


CB952493 




18 


444.4 


. 19. 


2 


481 


6 


CB158774 


c 


19 


442 


19. 


1 


583 


1 


AI694348 


c 


20 


437.8 


18. 


9 


462 


5 


BX109306 


c 


21 


435.8 


18. 


8 


439 


1 


AA633698 




22 


431.6 


18. 


6 


899 


4 


BI246549 


c 


23 


430 


18. 


5 


430 


1 


AI917116 


c 


24 


422 


18. 


2 


422 


1 


AI580389 


c 


25 


417 


18. 


0 


1055 


1 


AI654867 




26 


415.8 


17. 


9 


660 


2 


BB221592 


c 


27 


410./2 


17. 


7 


478 


1 


AI015041 




28 


407.8 


17. 


6 


585 


7 


CR770709 




29 


406.4 


17. 


5 


901 


4 


BI827790 




30 


404.8 


17. 


4 


1021 


7 


CK231266 


c 


31 


400 


17. 


2 


543 


2 


AW236852 




32 


. 398.4 


17. 


2 


552 


7 


CV029784 




33 


396.4 


17. 


1 


721 


4 


BG972539 


c 


34 


393.4 


17. 


0 


470 


1 


AA974327 




35 


389 


16. 


8 


531 


8 


AQ386013 


c 


36 


388.6 


16. 


8 


449 


1 


AI217703 




37 


387.8 


16. 


7 


738 


6 


CB954127 




38 


386.6 


16. 


7 


751 


8 


AQ748070 




39 


384.8 


16. 


6 


886 


4 


BI101074 


c 


40 


384.6 


16. 


6 


454 


1 


AA970255 




41 


384.4 


16. 


6 


751 


6 


CB952810 




42 


384.2 


16. 


6 


582 


5 


BP274546 




43 


380.8 


16. 


4 


578 


5 


BP275078 


c 


44 


380.4 


16. 


4 


580 


6 


CB423692 


c 


45 


376.8 


16. 


2 


566 


6 


CB433850 




46 


. 375 


16. 


2 


460 


7 


R19994 




47 


374.2 


16. 


1 


759 


4 


BG971626 




48 


372.4 


16. 


1 


565 


4 


BI538734 




49 


370.4 


16. 


0 


910 


4 


BI099853 




50 


368.4 


15. 


9 


763 


6 


CD241803 




51 


365.4 


15. 


8 


419 


7 


CR740722 


c 


52 


364.6 


15. 


7 


619 


9 


CE584093 




53 


361.6 


15. 


6 


1193 


4 


BI102573 


c 


54 


357 


15. 


4 


363 


7 


Z39483 




55 


354.4 


15. 


3 


360 


5 


BP431975 


c 


56 


346.8 


14. 


9 


357 


1 


AA747964 




57 


341 


14. 


7 


880 


4 


BI143297 




58 


339.2 


14. 


6 


1082 


3 


BC024453 


c 


59 


336.2 


14. 


5 


363 


2 


BF510946 




60 


335 


14. 


4 


908 


2 


BF788011 




61 


334.4 


14. 


4 


342 


5 


BP431877 




62 


331.8 


14. 


3 


349 


7 


F06529 




63 


330.2 


14. 


2 


538 


4 


BG971731 




64 


329.6 


14. 


2 


776 


4 


BI220321 




65 


329 


14. 


2 


956 


2 


BF784727 



BI102968 602888578 
CN396939 170005999 
CR769328 DKFZp4690 
BI559553 603252894 
AW173071 xj82gll.x 
CB435346 615435 MA 
7VK052644 Mus muscu 
CR767782 DKFZp469C 
CB952493 AGENCOURT 
CB158774 K-EST0218 
AI694348 wd45g04.x 
BX109306 BX109306 
AA633698 ag87all.s 
BI246549 602958449 
AI917116 ts52a02.x 
AI580389 tm42f08.x 
AI654867 wb65cl2.x 
BB221592 BB221592 
AI015041 ov51a05.s 
CR770709 DKFZp469J 
BI827790 603075472 
CK231266 ILLUMIGEN 
AW236852 xm49hll.x 
CV029784 8702 Full 
BG972539 602841125 
AA974327 oql0g06.s 
AQ386013 RPCIll-14 
AI217703 qhl5e09.x 
CB954127 AGENCOURT 
AQ748070 HS_5538_A 
BI101074 602886333 
AA970255 op64h06.s 
CB952810 AGENCOURT 
BP274546 BP274546 
BP275078 BP275078 
CB423692 597208 MA 
CB433850 610302 MA 
R19994 yg38h02.rl 
BG971626 602840706 
BI538734 434264 MA 
BI099853 602884666 
CD241803 AGENCOURT 
CR740722 CR740722 
CE584093 tigr-gss- 
BI102573 602889579 
Z39483 HSC1BC022 n 
BP431975 BP431975 
AA747964 nx79a08.s 
BI143297 602907843 
BC024453 Mus muscu 
BF510946 UI-H-BI4- 
BF788011 602113215 
BP431877 BP431877 
F06529 HSC1BC021 n 
BG971731 602838268 
BI220321 602935611 
BF784727 6021li365 





66 


323 


13. 


9 


785 


2 


BF780827 


BF780827 


602105713 


c 


67 


317 


13. 


7 


706 


9 


CE140389 


CE140389 


tigr-gss- 




68 


314.8 


13. 


6 


772 


4 


BI099719 


BI099719 


602884581 




69 


314.6 


13. 


6 


769 


2 


AW044731 


AW044731 


uml3h02.y 


c. 


70 


314.4 


13. 


6 


436 


8 


AQ476159 


AQ476159 


CITBI-El- 




71 


308.4 


13. 


3 


875 


2 


BF788790 


BF788790 


602107191 




72 


305,6 


13. 


2 


786 


6 


CB953250 


CB953250 


AGENCOURT 




73 


304.2 


13. 


1 


890 


4 


BI101705 


BI101705 


602887533 


c 


74 


303 


13. 


1 


799 


7 


CK848020 


CK848020 


970722 MA 




75 


301.2 


13. 


0 


777 


4 


BG972946 


BG972946 


602840675 




76 


298.6 


12. 


9 


711 


2 


BF785562 


BF785562 


602112344 




77 


298.6 


12. 


9 


965 


2 


BF789220 


BF789220 


602105115 




78 


298.4 


12. 


9 


755 


4 


BI220886 


BI220886 


602937437 




79 


296.4 


12. 


8 


878 


2 


BF782452 


BF782452 


602106160 


c 


80 


296 


12. 


8 


455 


2 


AW380602 


AW380602 


RC2-HT027 




81 


294.6 


12. 


7 


453 


1 


AA245657 


AA245657 


mx03a05 . r 




82 


293.4 


12. 


6 


459 


5 


BX951484 


BX951484 


DKFZp781F 




83 


293 


12. 


6 


913 


4 


BG972966 


BG972966 


602840774 




84 


292.2 


12. 


6 


1452 


2 


BF782437 


BF782437 


602106145 


c 


85 


290.2 


12. 


5 


429 


2 


AW236558 


AW236558 


xm47g07 . x 




86 


288.6 


12. 


4 


776 


2 


BF781397 


BF781397 


602104580 




87 


288.2 


12. 


4 


713 


4 


BI327590 


BI327590 


602979770 




88 


286.4 


12. 


3 


555 


5 


BP332215 


BP332215 


BP332215 




89 


284 


12. 


2 


551 


2 


BF785590 


BF785590 


602112377 




90 


282.2 


12. 


2 


488 


6 


CB433209 


CB433209 


609534 MA 




91 


277.8 


12. 


0 


788 


2 


BF781367 


BF781367 


602104549 




92 


277.6 


12. 


0 


863 


7 


CF218052 


CF218052 


AGENCOURT 




93 


275.6 


11. 


9 


699 


2 


AW319008 


AW319008 


unl0f04.y 




94 


275.2 


11. 


9 


735 


4 


BI332326 


BI332326 


602983510 




95 


273.4 


11. 


8 


371^ 


2 


BE843851 


BE843851 


RC0-TN007 




96 


273.2 


11. 


8 


696 


2 


BB653699 


BB653699 


BB653699 




97 


271.8 


11. 


7 


766 


6 


CB600123 


CB600123 


AGENCOURT 




98 


271 


11. 


7 


904 


2 


BF780781 


BF780781 


602103058 




99 


270.4 


11. 


7 


654 


2 


AW318818 


AW318818 


un07hll.y 




100 


268 


11. 


6 


695 


2 


BB598307 


BB598307 


BB598307 




101 


267.8 


11. 


5 


1027 


4 


BG969791 


BG969791 


602838703 




102 


267.4 


11. 


5 


668 


2 


BB660895 


BB660895 


BB660895 




103 


266.6 


11. 


5 


758 


4 


BI332502 


BI332502 


602981312 




104 


264.4 


11. 


4 


782 


4 


BG972529 


BG972529 


602841114 




105 


261.8 


11. 


3 


602 


6 


CB601068 


CB601068 


AGENCOURT 




106 


261.6 


11. 


3 


380 


7 


CR746104 


CR746104 


CR746104 




107 


259.8 


11. 


2 


701 


2 


BF787378 


BF787378 


602113777 




108 


258.8 


11. 


2 


812 


4 


BG969879 


BG969879 


602838824 




109 


255.2 


11. 


0 


399 


1 


AA245658 


75LA245658 


mx03a06 . r 


c 


110 


254.6 


11. 


0 


328 


1 


AI671780 


AI671780 


wa05h05.x 


c. 


111 


254.6 


11. 


0 


331 


1 


AI613123 


. «. AI613123 


ty71h04.x 


c 


112 


254.6 


11. 


0 


332 


1 


AI 672388 


AI672388 


ty64f01.x 




113 


254 


10. 


9 


916 


2 


BF687218 


BF687218 


602102243 


c 


114 


253 


10. 


9 


923 


5 


BX694171 


BX694171 


BX694171 


c 


115 


252.6 


10. 


9' 


920 


5 


BX701766 


BX701766 


BX701766 


c 


116 


251.4 


10. 


8 


922 


5 


BX701212 


BX701212 


BX701212 




117 


247.8 


10. 


7 


647 


2 


BB637357 


BB637357 


BB637357 


c 


118 


247.8 


10. 


7 


922 


5 


BX686029 


BX686029 


BX686029 




119 


246.8 


10. 


6 


581 


1 


AI528034 


AI528034 


uj39bll.y 




120 


246.8 


10. 


6 


593 


1 


AI526529 


AI526529 


uj39el0.y 


c 


121 


246.6 


10. 


6 


345 


2 


AW779709 


AW779709 


hn84f05.x 




122 ■ 


246.6 


10. 


6 


383 


5 


BX955526 


BX955526 


DKFZp781A 





123 


246.2 


10.6 


878 


4 


BG973007 


BG973007 


602841631 




124 


244.2 


10.5 


473 


2 


BF659710 


BF659710 


maa23c04 . 


c 


125 


243.8 


10.5 


919 


5 


BX698025 


BX698025 


BX698025 




126 


239.4 


10.3 


651 


1 


AI527981 


AI527981 


uj38ell.y 




127 


236.8 


10.2 


551 


1 


AI876961 


AI876961 


uj36h03.y 




128 


236.4 


10.2 


550 


1 


AI526647 


AI526647 


uj41b09.y 




129 


236 


10.2 


581 


7 


CF171593 


CF171593 


B0844F01- 




130 


235.4 


10.1 


726 


6 


CB600351 


CB600351 


AGENCOURT 




131 


233 


10.0 


534 


1 


AI787373 


AI787373 


uj31b07.y 


c 


132 


231.2 


10.0 


891 


5 


BX687753 


BX687753 


BX687753 


c 


133 


230.8 


9.9 


347 


1 


AI767712 


AI767712 


wh38h05.x 




134 


228.8 


9.9 


533 


1 


AI875380 


AI875380 


uk31e01.y 




135 


228.4 


9.8 


549 


1 


AI050425 


AI050425 


uc86b01.y 


c 


136 


227.2 


9.8 


288 


2 


AW235540 


AW235540 


xnl9dl2.x 




137 


227.2 


9.8 


867 


2 


BF785547 


BF785547 


602112327 




138 


226.8 


9.8 


469 


2 


AW610950 


AW610950 


un74c08 . y 




139 


225.2 


9.7 


531 


4 


BI463768 


BI463768 


603203473 


c 


140 


225.2 


9.7 


891 


5 


BX699365 


BX699365 


BX699365 




141 


224.8 


9.7 


925 


2 


BF781667 


BF781667 


602104385 




142 


222.6 


9.6 


887 


2 


BF785369 


BF785369 


602111710 




143 


222.4 


9.6 


562 


1 


AI746748 


AI746748 


ul07a01.y 




144 


222.2 


9.6 


536 


1 


AI787361 


AI787361 


uj31a06.y 




145 


220 


9.5 


524 


1 


AI876741 


AI876741 


uj 35c01 . y 




146 


218.6 


9.4 


536 


2 


AW226792 


AW226792 


uin62c04 . y 




147 


217.8 


9.4 


638 


4 


BI100051 


BI100051 


602885109 




148 


217.6 


9.4 


517 


5 


BX512263 


BX512263 


BX512263 




149 


216.6 


9.3 


533 


1 


AI747909 


AI747909 


ul03h02.y 




150 


216.6 


9.3 


555 


2 


AW107039 


AW107039 


ul92b06.y 




151 


216.6 


9.3 


649 


2 


BF788330 


BF788330 


602114069 


c 


152 


216.4 


9.3 


291 


1 


AI521956 


AI521956 


ti79e06.x 




153 


215 


9.3 


407 


6 


CB434861 


CB434861 


611551 MA 




154 


214 


9.2 


715 


4 


BG971579 


BG971579 


602840635 


c 


155 


213.8 


9.2 


813 


5 


BX693887 


BX693887 


BX693887 




156 


213 


9.2 


546 


2 


AW475287 


AW475287 


un63h08.y 


c 


157 


212.2 


9.1 


793 


7 


CF219588 


CF219588 


AGENCOURT 


c 


158 


211.8 


9.1 


796 


7 


CF218053 


CF218053 


AGENCOURT 




159 


211.4 


9.1 


769 


2 


BF782853 


BF782853 


602107722 




160 


210.6 


9.1 


611 


1 


AL794601 


AL794601 


AL794601 




161 


208.8 


9.0 


515 


2 


BF532434 


BF532434 


602074477 




162 


208.4 


9.0 


1349 


2 


BF686940 


BF686940 


602102714 




163 


207.8 


9.0 


• 968 


2 


BF780880 


fiF780880 


602105777 




164 


206.8 


8.9 


931 


7 


CF238676 


CF238676 


AGENCOURT 




165 


204.6 


8.8 


661 


6 


BY732658 


BY732658 


BY732658 




166 


202.4 


8.7 


489 


5 


BX522348 


BX522348 


BX522348 


c 


167 


197 


8.5 


253 


1 


AI373521 


AI373521 


qz46f08.x 


c 


168 


197 


. 8.5 


253 


1 


AI492912 


AI492912 


.qz.42f01.x 




169 


191.6 


8.3 


833 


2 


BF786313 


BF786313 


602112827 




170 


191.4 


8.2 


508 


1 


AL803540 


AL803540 


AL803540 


c 


171 


191.4 


8.2 


721 


4 


BJ640903 


BJ640903 


BJ640903 




172 


187.6 


8.1 


958 


2 


BF782294 


BF782294 


602106080 




173 


187.2 


8.1 


488 


2 


AW105948 


AW105948 


uin20c05.y 




174 


186.8 


8.1 


752 


6 


CB601161 


CB601161 


AGENCOURT 




175 


186 


8.0 


470 


2 


AW319385 


AW319385 


unl5f02.y 




176 


184.6 


8.0 


1783 


3 


CR636858 


CR636858 


Tetraodon 


c 


177 


183.2 


7.9 


361 


1 


AA730192 


AA730192 


nw41a02, s 




178 


183.2 


7.9 


375 


1 


AA715548 


AA715548 


nv53gll . r 




179 


183 


7.9 


758 


4 


BG969488 


BG969488 


602837021 





180 


179.8 


7.8 


478 


1 


AI874790 


AI874790 


ul28b05.y 




181 


179.4 


7.7 


389 


5 


BY105462 


BY105462 


BY105462 . 




182 


179 


7.7 


820 


7 


CF222788 


CF222788 


AGENCOURT 


c 


183 


177. 8 


7.7 


676 


4 


BJ641175 


BJ641175 


BJ64117S 


c 


184 


173.4 


7.5 


606 


9 


CE149561 


CE149561 


tigr-gss- 


c 


185 


172.2 


7.4 


603 


9 


CE843398 


CE843398 


tigr-gss- 


c 


186 


171.8 


7.4 


599 


1 


AA758115 


AA758115 


ah68d01.s 




187 


168.8 


7.3 


420 


2 


BB846612 


BB846612 


BB846612 




188 


168.8 


7.3 


831 


7 


CF219551 


CF219551 


AGENCOURT 


c 


189 


168.4 


7.3 


673 


1 


AI746432 


AI746432 


ul07a01.x 




190 


168.2 


7.2 


443 


1 


AI182048 


AI182048 


ud73d07.y 




191 


167.4 


7.2 


209 


7 


CR746121 


CR746121 


CR746121 




192 


166.6 


7.2 


463 


1 


AI931234 


AI931234 


ul58el0.y 


c 


193 


164 


7.1 


791 


7 


CF219552 


CF219552 


AGENCOURT 




194 


163.6 


7.1 


882 


7 


CF239400 


CF239400 


AGENCOURT 




195 


159.6 


6.9 


706 


7 


CN059643 


CN059643 


Salamande 


c 


196 


159.2 


6.9 


301 


1 


AI891486 


AI891486 


ul59b08.x 




197 


158.6 


6.8 


896 


6 


CA973829 


CA973829 


AGENCOURT 




198 


158.4 


6.8 


542 


4 


BI218377 


BI218377 


602937760 




199 


158.4 


6.8 


594 


8 


AZ519016 


AZ519016 


RPCI-11-6 




200 


156.8 


6.8 


464 


8 


AQ199079 


AQ199079 


RPCIll-67 




201 


156.2 


6.7 


771 


7 


CF222154 


CF222154 


AGENCOURT 




202 


155.8 


6.7 


426 


1 


AI529289 


AI529289 


ui61fl2.y 




203 


153 


6.6 


434 


2 


BB847184 


BB847184 


BB847184 




204 


152.4 


6.6 


428 


2 


BB847987 


BB847987 


BB847987 




205 


151.6 


6.5 


936 


6 


CA980884 


CA980884 


AGENCOURT 




206 


151.4 


6.5 


387 


1 


AA105944 


AA105944 


ml80g04.r 




207 


151 


6.5 


1946 


3 


AK050435 


AK050435 


Mus muscu 


c 


208 


150 


6.5 


607 


4 


BJ631992 


BJ631992 


BJ631992 




209 


149.4 


6.4 


2573 


3 


AK004971 


AK004971 


Mus muscu 




210 


148.2 


6.4 


438 


2 


BB848332 


BB848332 


BB848332 




211 


147.8 


6.4 


1581 


9 


AY418358 


AY418358 


Mus muscu 




212 


147.6 


6.4 


426 


2 


BB847742 


BB847742 


BB847742 




213 


147.6 


6.4 


858 


7 


CN986339 


CN986339 


61948 125 


c 


214 


147 


6.3 


651 


5 


BQ388289 


BQ388289 


NISC_mn28 




215 


146.4 


6.3 


827 


7 


CF219587 


CF219587 


AGENCOURT 




216 


146.2 


6.3 


426 


2 


BB846847 


BB846847 


BB846847 




217 


145.8 


6.3 


418 


5 


BY006680 


BY006680 


BY006680 




218 


145.6 


6.3 


838 


5 


BU899699 


BU899699 


AGENCOURT 




219 


144.8 


6.2 


424 


2 


BB848230 


BB848230 


BB848230 




220 


144.6 


6.2 


429 


2 


BB847232 


BB847232 


BB847232 




221 


144.2 


6.2 


493 


1 


AI746700 


AI74(5700 


ul06d06.y 


c 


222 


144.2 


6.2 


644 


4 


BJ098498 


BJ098498 


BJ098498 




223 


143.6 


6.2 


437 


1 


AI891771 


AI891771 


ul59b08.y 


c 


224 


143.4 


6.2 


1108 


7 


C0645645 


C0645645 


ILLUMIGEN 




225 


142 


6.1 


438 


2 


BB847387 


BB847387 


BB847387 




226 


141.4 


6.1 


430 


1 


AL778797 


AL778797 


AL778797 




227 


140.8 


6.1 


369 


2 


BB843421 


BB843421 


BB843421 




228 


140.6 


6.1 


933 


7 


C0775326 


C0775326 


ILLUMIGEN 




229 


140.2 


6.0 


1896 


3 


AK002736 


AK002736 


Mus muscu 




230 


138.8 


6.0 


418 


2 


BB847984 


BB847984 


BB847984 




231 


138 '. 6 


6.0 


362 


2 


BB847614 


BB847614 


BB847614 




232 


138.6 


6.0 


370 


2 


BB843440 


BB843440 


BB843440 




233 


138.2 


6.0 


528 


1 


AI876859 


AI876859 


uj35all.y 




234 


137.8 


5.9 


1892 


3 


AK034801 


AK034801 


Mus muscu 




235 


137.4 


5.9 


2575 


3 


BC048920 


BC048920 


Mus muscu 




236 


136 


5.9 


960 


5 


BQ713091 


BQ713091 


AGENCOURT 





237 


133.8 


5.8 


2895 


3 


AK083294 




238 


132.6 


5.7 


825 


5 


BP443785 




239 


132 


5.7 


1118 


7 


CK028589 




240 


131.6 


5.7 


1642 


6 


CD013996 




241 


131.4 


5.7 


927 


5 


BQ925596 




242 


130.8 


5.6 


1004 


5 


BQ942104 




243 


130.4 


5.6 


718 


4 


BJ038707 




244 


130.2 


5.6 


928 


7 


CK871071 




245 


130 


5.6 


821 


7 


CK953540 




246 


129.8 


5.6 


420 


5 


BY006031 




247 


129.8 


5.6 


432 


2 


BB846687 


c 


248 


129.4 


5.6 


740 


4 


BJ640531 




249 


129.2 


5.6 


424 


2 


BB846725 




250 


129.2 


5.6 


599 


4 


BI387131 




251 


129.2 


5.6 


631 


4 


BI387132 




252 


129 


5.6 


818 


2 


BF687167 




253 


128.2 


5.5 


1581 


9 


AY418356 




254 


127.4 


5.5 


425 


2 


BB844689 




255 


127.2 


5.5 


383 


5 


BY005749 




256 


127.2 


5.5 


415 


2 


BB847175 




257 


124.8 


5.4 


430 


5 


BY006688 




258 


124.8 


5.4 


1023 


6 


CD013994 




259 


124.8 


5.4 


1336 


6 


CD013995 




260 


124.6 


5.4 


735 


7 


CK949530 




261 


124 


5.3 


1783 


3 


AK050327 




262 


123.6 


5.3 


338 


2 


BB846932 




263 


123 


5.3 


930 


5 


BQ934372 




264 


123 


5.3 


1957 


3 


CR671108 




265 


122.8 


5.3 


423 


5 


BY006690 




266 


122.6 


5.3 


825 


5 


BU240452 




267 


122.4 


5.3 


830 


6 


CA493727 




268 


122 


5.3 


977 


7 


CO579097 


c 


269 


121.6 


5.2 


527 


8 


AQ993185 


c 


270 


121.6 


5.2 


694 


8 


AZ085726 




271 


121.6 


5.2 


810 


7 


CO574370 




272 


121.6 


5.2 


843 


7 


C0558457 


c 


273 


121.2 


5.2 


567 


4 


BI443347 




274 


121 


5.2 


340 


2 


BB846937 


c 


275 


120.8 


5.2 


405 


1 


AI433413 




276 


120.8 


5.2 


2099 


3 


AK008601 




277 


120.4 


5.2 


596 


1 


AL781524 




278 


120.4. 


5.2 


651 


1 


AA255327 




279 


119.8 


5.2 


1130 


3 


CR646329 


c 


280 


119.6 


5.2 


594 


9 


FR0049565 




281 


119.2 


5.1 


737 


7 


C0562727 




282 


..^U19 


5.1 


795 


5 


BX880311 




283 


119 


5.1 


835 


7 


C0573273 




284 


119 


5.1 


882 


4 


BI144244 




285 


118.8 


5.1 


494 


1 


AI785732 




286 


118.8 


5.1 


629 


5 


BP138260 


c 


287 


118 


5.1 


148 


2 


BF089587 




288 


117.6 


5.1 


2450 


3 


BC011409 




289 


117.4 


5.1 


768 


5 


BX876854 


c 


290 


117.2 


5.1 


1067 


9 


CNS05HQH 




291 


116.6 


5.0 


738 


6 


CB418995 




292 


115.4 


5.0 


619 


9 


FR0007657 




293 


115.4 


5.0 


962 


4 


BG969620 



AK083294 Mus muscu 
BP443785 BP443785 
CK028589 AGENCOURT 
CD013996 90117309 
BQ925596 AGENCOURT 
BQ942104 AGENCOURT 
BJ038707 BJ038707 
CK871071 AGENCOURT 
CK953540 4093245 B 
BY006031 BY006031 
BB846687 BB846687 
BJ640531 BJ640531 
BB846725 BB846725 
BI387131 BFL26_001 
BI387132 BFL26_001 
BF687167 602102982 
AY4 18356 Homo sapi 
BB844689 BB844689 
BY005749 BY005749 
BB847175 BB847175 
BY006688 BY006688 
CD013994 90130114 
CD013995 90130122 
CK949530 4074896 B 
AK050327 Mus muscu 
BB846932 BB846932 
BQ934372 AGENCOURT 
CR671108 Tetraodon 
BY006690 BY006690 
BU240452 603323679 
CA4 93727 AGENCOURT 
CO57 9097 ILLUMIGEN 
AQ993185 RPCI-23-3 
AZ085726 RPCI-23-6 
CO574370 AGENCOURT 
C0558457 AGENCOURT 
BI443347 dai90a05. 
BB846937 BB846937 
AI433413 ti65c03.x 
AK008601 Mus muscu 
AL781524 AL781524 
AA255327 mz84fl0.r 
CR646329 Tetraodon 
AL605374 Fugu rubr 
C0562727 AGENCOURT 
BX8.80311 BX880311 
C0573273 AGENCOURT 
BI144244 602907984 
AI785732 uj37h01.y 
BP138260 BP138260 
BF089587 CM2-HT094 
BC011409 Homo sapi 
BX876854 BX876854 
AL337922 Tetraodon 
CB418995 591831 MA 
Z91467 F.rubripes 
BG969620 602836951 



294 


115 


5. 


0 


615 


1 


AL803499 


AL803499 


AL803499 


295 


115 


5. 


0 


1030 


5 


BQ714919. 


. BQ714919 


AGENCOURT 


296 


114.4 


4. 


9 


704 


6 


CB518006 


CB518006 


ssalrgb53 


297 


114 


4. 


9 


2079 


6 


CD013998 


CD013998 


90117389 


298 


113.8 


4. 


9 


634 


6 


CB955397 


CB955397 


AGENCOURT 


299 


113.8 


4. 


9 


1005 


4 


BG167171 


BG167171 


602344696 


300 


113.4 


4. 


9 


618 


1 


AL787677 


AL787677 


AL787677 


301 


113.4 


4. 


9 


633 


1 


AL789572 


AL789572 


AL789572 


302 


113.4 


4. 


9 


759 


4 


BJ730921 


BJ730921 


BJ730921 


303 


113 


4. 


9 


677 


7 


CK981590 


CK981590 


4114067 B 


304 


113 


4. 


9 


1570 


3 


CR646752 


CR646752 


Tetraodon 


305 


112.8 


4. 


9 


685 


4 


BJ495916 


BJ495916 


BJ495916 


306 


112.4 


4. 


8 


774 


7 


C0423341 


C0423341 


GGEZHTIOO 


307 


111.6 


4. 


8 


513 


2 


BF105272 


BF105272 


601822259 


308 


111.4 


4. 


8 


606 


6 


CB581397 


CB581397 


AMGNNUC : C 


309 


111.4 


4. 


8 


2394 


3 


BC043491 


BC043491 


Homo sapi 


310 


111 


4. 


8 


1126 


7 


CO645680 


CO645680 


ILLUMIGEN 


311 


110.8 


4. 


8 


605 


6 


CB581581 


CB581581 


AMGNNUC :N 


312 


110.6 


4. 


8 


977 


4 


BI331895 


BI331895 


602982580 


313 


110.4 


4. 


8 


691 


8 


AZ614547 


AZ614547 


1M0443J19 


314 


110.4 


4. 


8 


1770 


3 


CR644097 


CR644097 


Tetraodon 


315 


109.4 


4. 


7 


510 


6 


CB286305 


CB286305 


CMD34_E11 


316 


109.2 


4. 


7 


410 


2 


AW226896 


AW226896 


um63b09.y 


317 


109.2 


4. 


7 


848 


7 


CN986271 


CN986271 


61879 125 


318 


108.4 


4. 


7 


867 


7 


CN172607 


CN172607 


AGENCOURT 


319 


107.8 


4. 


6 


514 


6 


CB286306 


CB286306 


CMD34__E12 


320 


107.4 


4. 


6 


673 


6 


CA042007 


CA042007 


ssalplnbS 


321 


107.2 


4. 


6 


685 


5 


BU136075 


BU136075 


603124201 


322 


107.2 


4. 


6 


839 


7 


CF249906 


CF249906 


esa006 aO 


323 


107.2 


4. 


6 


861 


7 


CN159685 


CN159685 


'948778 MA 


324 


107.2 


4. 


6 


970 


5 


BU122351 


BU122351 


603146762 


325 


107.2 


4. 


6 


988 


5 


BU122681 


BU122681 


603149059 


326 


106.8 


4. 


6 


584 


1 


AL793872 


AL793872 


AL793872 


327 


106.2 


4. 


6 


840 


4 


BJ746350 


BJ746350 


BJ746350 


328 


106.2 


4. 


6 


888 


9 


AY418357 


AY418357 


Pan trogl 


329 


106 


4. 


6 


674 


7 


CK948396 


CK948396 


4073279 B 


330 


105.8 


4. 


6 


580 


8 


AQ059652 


AQ059652 


CIT-HSP-2 


331 


105.4 


4. 


5 


751 


5 


BP452350 


BP452350 


BP452350 


332 


104.8 


4. 


5 


700 


6 


CB497940 


CB497940 


omykrbhbO 


333 


104.6 


4 . 


5 


383 


2 


AW107617 


AW107617 


ul93c02.y 


334 


104.6 


4. 


5 


653 


6 


CD309485 


CD309485 


StrPu691. 


335 


104.4 


4. 


5 


677 


7 


CK956169 


CK956169 


4096235 B 


336 


103.8 


4. 


5 


835 


4 


BI757311 


BI757311 


603029525 


337 


103.6 


4. 


5 


613 


1 


AL679732 


AL679732 


AL679732 


338 


103.4 


4. 


5 


652 


6 


CA343408 


CA343408 


673504 NC 


339 


102.8 


4. 


4 


1188 


6^ 


,-CD505366 


CD505366 


CDA73-C09 


340 


102.6 


4. 


4 


651 


4 


BJ094864 


BJ094864 


BJ094864 


341 


102 


4. 


4 


673 


4 


BJ640772 


BJ640772 


BJ640772 


342 


101.8 


4. 


4 


310 


5 


BX636608 


BX636608 


BX636608 


343 


101.8 


4. 


4 


701 


1 


AL133897 


AL133897 


DKFZp761K 


344 


101.6 


4. 


4 


492 


4 


BG970195 


BG970195 


602839368 


345 


101. '6 


4. 


4 


2202 


3 


AK002629 


AK002629 


Mus muscu 


346 


101.4 


4. 


4 


781 


1 


AI529761 


AI529761 


ui82b06.y 


347 


101 


4. 


4 


652 


4 


BJ624350 


BJ624350 


BJ624350 


348 


101 


4. 


4 


879 


7 


C0774398 


C0774398 


ILLUMIGEN 


349 


101 


4. 


4 


1101 


9 


CNS059KU 


AL327351 


Tetraodon 


350 


100.8 


4. 


3 


315 


1 


AA104237 


AA104237 


rapOlbOl.r 





351 


100.8 


4.3 


588 


5 


BP448488 


BP448488 


BP448488 




352 


100.8 


4.3 


615 


5 


BP449052 


BP449052 


BP449052 




353 


100.8 


4.3 


794 


4 


BG575052 


BG575052 


602598159 




354 


100.8 


4.3 


913 


7 


C0579137 


C0579137 


ILLUMIGEN 




355 


100 


4.3 


974 


7 


C0579983 


C0579983 


ILLUMIGEN 




356 


100 


4.3 


996 


5 


BQ714776 


BQ714776 


AGENCOURT 




357 


99 


4.3 


617 


4 


BJ615347 


BJ615347 


BJ615347 




358 


98.6 


4.2 


708 


4 


BJ731220 


BJ731220 


BJ731220 




359 


98.6 


4.2 


708 


7 


CV527312 


CV527312 


CS GIL 14 




360 


98.6 


4.2 


750 


5 


BP448650 


BP448650 


BP448650 




361 


98.6 


4.2 


1022 


5 


BM925650 


BM925650 


AGENCOURT 




362 


98.4 


4.2 


564 


6 


CA347839 


CA347839 


679020 NC 




363 


98.4 


4.2 


2306 


3 


BC012716 


BC012716 


Mus muscu 




364 


98.2 


4.2 


759 


4 


BI330877 


BI330877 


602981265 




365 


98.2 


4.2 


790 


7 


C0555662 


C0555662 


AGENCOURT 


c 


366 


98.2 


4.2 


808 


4 


BJ716530 


BJ716530 


BJ716530 


c 


367 


98 


4.2 


834 


4 


BJ746664 


BJ746664 


BJ746664 




368 


97.8 


4.2 


710 


2 


BB667498 


BB667498 


BB667498 


c 


369 


97.6 


4.2 


761 


1 


AI118428 


AI118428 


ue40e09.x 




370 


97.4 


4.2 


656 


5 


BQ388290 


BQ388290 


NISC_itin28 




371 


97.4 


4.2 


728 


6 


CD493117 


CD493117 


CDA03-C12 




372 


97.4 


4.2 


807 


5 


BU901536 


BU901536 


AGENCOURT 




373 


97.4 


4.2 


921 


7 


CN065626 


CN065626 


D15_Ag2_p 




374 


97.2 


4.2 


697 


2 


BB667336 


BB667336 


BB667336 




375 


97.2 


4.2 


697 


4 


BG747143 


BG747143 


602704364 




376 


97 


4.2 


874 


7 


CN985398 


CN985398 


58371 126 




377 


96.6 


4.2 


581 


1 


AL791968 


AL791968 


AL791968 


c 


378 


96.6 


4.2 


639 


2 


BF322562 


BF322562 


maa28el2. 




379 


96.4 


4.2 


654 


4 


BJ060547 


BJ060547 


BJ060547 


c 


380 


96.4 


4.2 


861 


7 


CK974452 


CK974452 


4105352 B 




381 


96.4 


4.2 


871 


5 


BQ900474 


BQ900474 


AGENCOURT 




382 


96.4 


4.2 


913 


5 


BQ714368 


BQ714368 


AGENCOURT 




383 


96.4 


4.2 


938 


5 


BQ942682 


BQ942682 


AGENCOURT 




384 


96 


4.1 


643 


7 


CV222277 


CV222277 


Le mxO 58 


c 


385 • 


96 


4.1 


896 


7 


CN159120 


CN159120 


948155 MA 




386 


95.6 


4.1 


603 


6 


CB163625 


CB163625 


K-EST0224 




387 


95.6 


4.1 


662 


1 


AL792519 


AL792519 


AL792519 




388 


95.6 


4.1 


665 


1 


7\L638480 


AL638480 


AL638480 




389 


95.4 


4.1 


573 


1 


AL789331 


AL789331 


AL789331 




390 


95.4 


4.1 


1131 


2 


BF687385 


BF687385 


602102551 




391 


95.2 


4.1 


611 


7 


CF250034 


CF250034 


esa007 fO 




392 


95.2 


4.1 


1005 


2 


BE870911 


BE870911 


601448862 




393 


95 


4.1 


430 


2 


BB847324 


BB847324 


BB847324 


c 


394 


95 


4.1 


739 


7 


C0387359 


C0387359 


AGENCOURT 




395 


94.8 


4.1 


557 


2 


AW318536 


AW318536 


um98f lO.y 




396 


94.6 


4.1 


573 


7 


C0298247 


C0298247 


EK172617. 


c 


397 


94.6 


4.1 


598 


1 


AI303476 


AI303476 


ui71e01.x 


c 


398 


94.4 


4.1 


555 


1 


AI265736 


AI265736 


uj05fl2.x 




399 


94.4 


4.1 


934 


2 


BF786200 


BF786200 


602110490 




400 


94.2 


4.1 


415 


1 


AI959352 


AI959352 


fd08e07.y 




401 


94.2 


4.1 


561 


7 


CN227962 


CN227962 


RJB035D05 




402 


94.2 


4.1 


582 


6 


CA386855 


CA386855 


668573 NC 




403 


94.2 


4.1 


650 


7 


CV041502 


CV041502 


4139371 B 




404 


94.2 


4.1 


666 


6 


CA348546 


CA348546 


679900 NC 




405 


94.2 


4.1 


668 


6 


CA349227 


CA349227 


619554 NC 




406 


93.8 


4.0 


415 


5 


BY006441 


BY006441 


BY006441 




407 


93.8 


4.0 


424 


2 


BB844926 


BB844926 


BB844926 





408 


93.8 


4.0 


554 


2 


BE013295 


BE013295 


123199 MA 




409 


93.8 


4.0 


1192 


2 


BF789664 


BF789664 


602105279 




410 


93.6 


4.0 


718 


5 


BX876842 


BX876842 


BX876842 




411 


93.6 


4.0 


724 


5 


BX875102 


BX875102 


BX875102 




412 


93.6 


4.0 


755 


5 


BX889578 


BX889578 


BX889578 




413 


93.6 


4.0 


759 


5 


BX885634 


BX885634 


BX885634 




414 


93 


4.0 


690 


8 


AZ614319 


AZ614319 


1M0443N05 


c 


415 


93 


4.0 


728 


7 


CN585956 


CN585956 


USDA-FP 1 




416 


93 


4.0 


746 


2 


BF688309 


BF688309 


602185172 


c 


417 


93 


4.0 


822 


7 


CN761092 


CN761092 


ID0AAA2BE 




418 


92.8 


4.0 


581 


5 


BP275541 


BP275541 


BP275541 




419 


92.8 


4.0 


753 


7 


CK029964 


CK029964 


AGENCOURT 




420 


92.8 


4.0 


111 


7 


C0573842 


C0573842 


AGENCOURT 




421 


92.6 


4.0 


547 


7 


CR751755 


• CR751755 


DKFZp469C 


c 


422 


92.6 


4.0 


llA 


7 


CF343072 


CF343072 


AGENCOURT 




423 


92.4 


4.0 


616 


6 


CB163341 


CB163341 


K-EST0224 




424 


91.6 


3.9 


356 


5 


BY005600 


BY005600 


BY005600 




425 


91.6 


3.9 


776 


4 


BI103442 


BI103442 


602888971 




426 


91.2 


3.9 


572 


2 


AW917402 


AW917402 


EST348706 




427 


91.2 


3.9 


722 


6 


CB951410 


CB951410 


AGENCOURT 




428 


91 


3.9 


535 


1 


AL911920 


AL911920 


AL911920 


c 


429 


91 


3.9 


551 


4 


BJ021349 


BJ021349 


BJ021349 




430 


91 


3.9 


606 


4 


BJ009417 


BJ009417 


BJ009417 


c 


431 


91 


3.9 


674 


4 


BJ020682 


BJ020682 


BJ020682 


c 


432 


91 


3.9 


687 


4 


BJ019313 


BJ019313 


BJ019313 


c 


433 


91 


3.9 


733 


4 


BJ516248 


BJ516248 


BJ516248 


c 


434 


91 


3.9 


753 


4 


BJ518554 


BJ518554 


BJ518554 


c 


435 


91 


3.9 


756 


4 


BJ533312 


BJ533312 


BJ533312 




436 


90.8 


3.9 


667 


7 


CK947903 


CK947903 


4072727 B 




437 


90.8 


3.9 


670 


7 


CN793669 


CN793669 


4128731 B 




438 


90.8 


3.9 


732 


7 


CK967438 


CK967438 


4082607 B 




439 


90.6 


3.9 


599 


4 


BJ493042 


BJ493042 


BJ493042 




440 


90.6 


3.9 


616 


4 


BJ501742 


BJ501742 


BJ501742 




441 


90.6 


3.9 


910 


7 


CN992653 


CN992653 


70709_125 


c 


442 


90.4 


3.9 


732 


3 


CNS08LE2 


BX017142 


Single re 


c 


443 


90.2 


3.9 


811 


6 


CD632277 


CD632277 


56066634J 


c 


444 


90 


3.9 


643 


1 


AI785430 


AI785430 


uj42b06.x 


c 


445 


90 


3.9 


725 


4 


BJ518275 


BJ518275 


BJ518275 




446 


89.8 


3.9 


958 


4 


BG167110 


BG167110 


602344612 


c 


447 


89.8 


3.9 


1125 


9 


CNS03SJZ 


AL258632 


Tetraodon 




448 


89.6 


3.9 


969 


4 


BG291839 


BG291839 


602386068 




449 


89.4 


3.9 


904 


5 


BU121683 


BU121683 


603145755 




450 


89.4 


3.9 


905 


7 


CN173591 


CN173591 


AGENCOURT 


c 


451 


89.2 


3.8 


674 


7 


CO704930 


CO704930 


DG32-292d 


c 


452 


89 


3.8 


127 


2 


BF514925 


BF514925 


UI-H-BWl- 




453 


88.8 


.....3 . 8 


624 


7 


CN275923 


CN275923 


.,17.0004245 




454 


88.8 


3.8 


975 


2 


BF533975 


BF533975 


602075137 


c 


455 


88.4 


3.8 


602 


3 


CNS08M60 


BX018148 


Single re 




456 


88.2 


3.8 


398 


5 


BY006400 


BY006400 


BY006400 


c 


457 


88.2 


3.8 


823 


2 


BF689099 


BF689099 


602185172 




458 


88.2 


3.8 


831 


6 


CD105882 


CD105882 


AGENCOURT 




459 


88 


' 3.8 


812 


4 


BI221130 


BI221130 


602939634 




460 


87.8 


3.8 


618 


9 


FR0007653 


Z91463 F.rubripes 




461 


87.8 


3.8 


891 


4 


BG189133 


BG189133 


RST8173 A 




462 


87.4 


3.8 


510 


5 


BX483530 


BX483530 


DKFZp686C 




463 


87.4 


3.8 


1201 


9 


CNS015XW 


AL106046 


Drosophil 




464 


87.2 


3.8 


579 


5 


BP277054 


BP277054 


BP277054 





465 


87 


3.8 


812 


5 


BU364284 


BU364284 


603584620 




466 


86.8 


3.7 


^ 447 


5 


BX618498 


BX618498 


BX618498 


c 


467 


86.8 


3.7 


615 


7 


CK133545 


CK133545 


RE29447.3 




468 


86.4 


3.7 


830 


4 


BG195984 


BG195984 


RST15067 




469 


86.2 


3.7 


550 


1 


AI097692 


AI097692 


ue35bll.y 


c 


470 


86.2 


3.7 


658 


6 


CB503528 


CB503528 


ssalmgeSO 




471 


86 


3.7 


794 


4 


BI328333 


BI328333 


602985653 


c 


472 


85.6 


3.7 


548 


7 


C0698361 


C0698361 


DG32-135b 


c 


473 


85.4 


3.7 


618 


9 


FR0049525 


AL605334 


Fugu rubr 




474 


85.2 


3.7 


529 


6 


CB224174 


CB224174 


1JEJ32C4 




475 


85 


3.7 


1044 


9 


CNS044X4 


> AL274657 


Tetraodon 


c 


476 


84.8 


3.7 


659 


7 


CK132607 


CK132607 


RE03265.3 




477 


84.6 


3.6 


688 


7 


CK977467 


CK977467 


4108686 B 




478 


84.4 


3.6 


416 


7 


CF919654 


CF919654 


Bflor531. 




479 


84.4 


3.6 


455 


4 


BI387360 


BI387360 


BFL26_002 




480 


84.4 


3.6 


551 


4 


BI442788 


BI442788 


dai90a05. 




481 


84.4 


3.6 


702 


5 


BU038648 


BU038648 


DH02E04 H 




482 


84.2 


3.6 


581 


5 


BP360001 


BP360001 


BP360001 




483 


84.2 


3.6 


788 


5 


BX624452 


BX624452 


BX624452 




484 


84 


3.6 


303 


7 


CR738701 


CR738701 


CR738701 


c 


485 


84 


3.6 


607 


7 


CK132585 


CK132585 


RE02574.3 




486 


83.8 


3.6 


428 


5 


BY006488 


BY006488 


BY006488 




487 


83.6 


3.6 


840 


5 


BX624575 


BX624575 


BX624575 


c 


488 


83.2 


3.6 


708 


5 


BQ202033 


BQ202033 


UI-R-EBl- 




489 


83 


3.6 


660 


6 


CD703965 


CD703965 


EST20492 




490 


83 


3.6 


1068 


5 


BU145928 


BU145928 


AGENCOURT 




491 


82.6 


3.6 


728 


4 


BG569536 


BG569536 


602588811 




492 


82.4 


3.6 


736 


7 


CK950173 


CK950173 


4075687 B 




493 


82.2 


3.5 


625 


7 


CK951591 


CK951591 


4090903 B 




494 


82.2 


3.5 


1016 


9 


CNS03VYT 


AL263054 


Tetraodon 




495 


81.6 


3.5 


666 


4 


BG971005 


BG971005 


602840033 




496 


81.4 


3.5 


557 


2 


BF080461 


BF080461 


231360 MA 


c 


497 


81.2 


3.5 


656 


2 


AW338024 


AW338024 


xw65c05.x 


c 


498 


81.2 


3.5 


1080 


9 


CNS03FCV 


AL241528 


Tetraodon 




499 


81 


3.5 


551 


6 


CD331938 


CD331938 


StrPu537. 


c 


500 


81 


3.5 


638 


7 


CK950104 


CK950104 


4075303 B 




501 


80.8 


3.5 


789 


7 


CO808016 


CO808016 


AGENCOURT 




502 


80.4 


3.5 


863 


7 


CK974673 


CK974673 


4105736 B 




503 


80.2 


3.5 


1100 


5 


BQ278891 


BQ278891 


AGENCOURT 




504 


79.6 


3.4 


578 


7 


CK888307 


CK888307 


SGP149514 




505 


79.4 


3.4 


696 


4 


BG971909 


BG971909 


602841666 




506 


79.4 


3.4 


886 


4 


BG402016 


BG402016 


602466734 




507 


79.2 


3.4 


651 


7 


C0259652 


C0259652 


4131159 B 




508 


79.2 


3.4 


842 


7 


C0246322 


C0246322 


AGENCOURT 


c 


509 


79 


3.4 


514 


1 


AI132717 


AI132717 


ue33h09.x 




510 


79 


3.4 


707 


7 


C0641.589 


C0641589 


USDA-FP 1 




511 


78.8 


3.4 


452 


4 


BM090021 


BM090021 


503910 MA 




512 


78.8 


3.4 


642 


6 


CB155303 


CB155303 


K-EST0213 




513 


78.8 


3.4 


759 


7 


CN275924 


CN275924 


170006008 




514 


78.6 


3.4 


328 


5 


BY005676 


BY005676 


BY005676 




515 


78.6 


3.4 


823 


4 


BG193837 


BG193837 


RST12973 




516 


78 


3.4 


801 


4 


BG212458 


BG212458 


RST32045 




517 


77.8 


3.4 


449 


2 


BF080415 


BF080415 


231292 MA 


c 


518 


77.8 


3.4 


614' 


6 


CB169862 


CB169862 


KSV603161 




519 


77.6 


3.3 


441 


9 


CNS07H5L 


AL610635 


Anopheles 


c 


520 


77.6 


3.3 


589 


1 


AI256545 


AI256545 


ui87g04.x 


c 


521 


77.6 


3.3 


592 


7 


CK131146 


CK131146 


GH06505.3 





522 


77.2 


3.3 


425 


4 


BJ624661 


BJ624661 


BJ624661 




523 


77 


3.3 


1297 


3 


AK078371 


AK078371 


Mus muscu 


c 


524 


76.6 


3.3 


538 


6 


CD336301 


CD336301 


StrPu537. 


c 


525 


76.6 


3.3 


616 


4 


BJ507779 


BJ507779 


BJ507779 




526 


76.6 


3.3 


1107 


5 


BM924331 


BM924331 


AGENCOURT 




527 


75.8 


3.3 


1118 


2 


BF582087 


BF582087 


602099271 




528 


75.6 


3.3 


414 


7 


C0994585 


C0994585 


UMC-pdl2 


c 


529 


75.6 


3.3 


584 


4 


BJ023549 


BJ023549 


BJ023549 




530 


75.6 


3.3 


652 


4 


BJ705583 


BJ705583 


BJ705583 




531 


75.6 


3.3 


739 


7 


CO047895 


CO047895 


Lr ATICF 




532 


75.4 


3.2 


935 


4 


BG427877 


BG427877 


602501564 




533 


75.2 


3.2 


792 


2 


BF786091 


BF786091 


602110662 




534 


75.2 


3.2 


1110 


2 


BF533824 


BF533824 


602075352 




535 


75 


3.2 


568 


1 


AA212451 


AA212451 


mw76gl2 . r 




536 


75 


3.2 


648 


7 


CN983881 


CN983881 


52591 126 


c 


537 


75 


3.2 


732 


4 


BJ743004 


BJ743004 


BJ743004 




538 


74.8 


3.2 


616 


1 


AA255225 


AA255225 


mz85d04 . r 




539 


74.6 


3.2 


494 


4 


BJ623942 


BJ623942 


BJ623942 




540 


74.6 


3.2 


557 


6 


CB157680 


CB157680 


K-EST0216 




541 


74.6 


3.2 


770 


4 


BG201991 


BG201991 


RST21337 




542 


74.2 


3.2 


694 


1 


AU129743 


AU129743 


AU129743 




543 


74.2 


3.2 


782 


4 


BG167429 


BG167429 


602342703 




544 


74.2 


3.2 


812 


1 


AI663013 


AI663013 


uj69fl0.y 


c 


545 


74.2 


3.2 


915 


3 


CNS08R35 


BX024525 


Single re 




546 


■ 74 


3.2 


582 


1 


AA075453 


AA075453 


zm87e07.r 




547 


73.8 


3.2 


778 


7 


CF250901 


CF250901 


esa018_e0 




548 


73.6 


3.2 


136 


1 


AA073542 


AA073542 


inm94b03.r 




549 


73.2 


3.2 


508 


2 


BF383473 


BF383473 


602045154 




550 


73.2 


3.2 


599 


4 


BJ488432 


BJ488432 


BJ488432 




551 


73.2 


3.2 


751 


7 


CR769273 


CR769273 


DKFZp469I 




552 


73 


3.1 


533 


2 


BF131143 


BF131143 


601819955 




553 


73 


3.1 


573 


6 


CB161428 


CB161428 


K-ESTQ221 




554 


73 


3.1 


837 


7 


CN763566 


CN763566 


ID0AAA7BG 


c 


555 


73 


3.1 


980 


1 


AI256260 


AI256260 


ui96hl2.x 


c 


556 


72.8 


3.1 


517 


2 


AW604423 


AW604423 


RC3-CT025 


c 


557 


72.8 


3.1 


662 


7 


CK948186 


CK948186 


4072991 B 




558 


72.8 


3.1 


723 


4 


BG424452 


BG424452 


602447772 


c 


559 


72.6 


3.1 


565 


7 


CO328600 


CO328600 


EK291003. 




560 


72.6 


3.1 


741 


7 


CR559577 


CR559577 


DKFZp459B 




561 


72.4 


3.1 


584 


6 


CB586065 


CB586065 


AMGNNUC:N 




562 


72.4 


3.1 


935 


2 


BF234991 


- BF234991 


602027544 




563 


72.2 


3.1 


765 


7 


CK959288 


CK959288 


4100123 B 


c 


564 


72.2 


3.1 


781 


4 


BJ748442 


BJ748442 


BJ748442 




565 


72 


3.1 


361 


5 


BY082449 


BY082449 


BY082449 




566 


72 


3.1 


554 


5 


BP243787 


BP243787 


BP243787 




56,7.., 


72 


3.1 


560 


5 


BX485008 


.^.^ BX4 85008 


DKFZp686G 




568 


72 


3.1 


670 


7 


C0641121 


C0641121 


USDA-FP 1 




569 


72 


3.1 


775 


4 


BI144659 


BI144659 


602909950 




570 


71.8 


3.1 


935 


2 


BF532873 


BF532873 


602074982 


c 


571 


71.8 


3.1 


1028 


9 


CNS046W3 


AL277212 


Tetraodon 




572 


71.6 


3.1 


729 


5 


BQ179273 


BQ179273 


UI-M-EWO- 




573 


71.6 


3.1 


793 


7 


CF662633 


CF662633 


CcLL03all 




574 


71.4 


3.1 


690 


6 


CB467191 


CB467191 


732919 MA 


c 


575 


71 


3.1 


430 


4 


BI451798 


BI451798 


TY14E03 h 




576 


71 


3.1 


725 


7 


CK349802 


CK349802 


hggfhal8B 




577 


70.8 


3.1 


629 


4 


BM791817 


BM791817 


K-EST0071 


c 


578 


70.6 


3.0 


512 


1 


AI265232 


AI265232 


ui97d02.x 





579 


70.4 


3.0 


680 


5 


BX889196 




580 


70.4 


3.0 


871 


7 , 


. CN972726 




581 


70.4 


3.0 


944 


9 


CG770584 




582 


70 


3.0 


689 


4 


BM642521 




583 


70 


3.0 


733 


7 


CO561098 




584 


69.8 


3.0 


547 


7 


CR763995 




585 


69.8 


3.0 


688 


4 


BG286763 




586 


69.6 


3.0 


353 


5 


BY005597 


c 


587 


69.6 


3.0 


484 


1 


AI265260 




588 


69.6 


3.0 


698 


7 


CK450042 




589 


69.4 


3.0 


563 


4 


BM624521 




590 


69.4 


3.0 


722 


5 


BX879296 


c 


591 


69.4 


3.0 


1199 


6 


CD503770 




592 


69.2 


3.0 


789 


7 


C0558955 


c 


593 


69 


3.0 


315 


5 


BY356244 


c 


594 


69 


3.0 


517 


1 


AI118387 


c 


595 


68.8 


3.0 


498 


1 


AI118135 




596 


68.8 


3.0 


515 


2 


BB283598 


c 


597 


68.8 


3.0 


669 


4 


BG203058 




598 


68.6 


3.0 


441 


4 


BG876589 




599 


68.6 


3.0 


614 


4 


BM607789 




600 


68.6 


3.0 


837 


7 


CN976403 




601 


68.6 


3.0 


847 


4 


BG400151 




602 


68.6 


3.0 


855 


7 


CN977220 




603 


68.4 


2.9 


446 


1 


AL694265 




604 


68.2 


2.9 


428 


8 


AQ194654 


c 


605 


68.2 


2.9 


480 


1 


AI787349 


c 


606 


68.2 


2.9 


666 


7 


CK838901 




607 


68 


2.9 


445 


7 


CN750793 




608 


68 


2.9 


510 


1 


AI386060 




609 


67.8 


2.9 


512 


6 


CA380373 




610 


67.8 


2.9 


563 


7 


CK349927 




611 


67.6 


2.9 


470 


2 


BB747310 


c 


612 


67.6 


2.9 


672 


6 


CA331021 




613 


67.6 


2.9 


792 


7 


C0573111 


c 


614 


67.2 


2.9 


588 


6 


CB505187 




615 


67 


2.9 


400 


7 


CN976669 


c 


616 


67 


2.9 


692 


9 


AG078157 




617 


67 


2.9 


738 


7 


CK474315 


c 


618 


66.6 


2.9 


535 


7 


C0284467 




619 


66.6 


2.9 


616 


2 


AW701598 




620 


66.6 


2.9 


631 


4 


BJ004103 




621 


66.6 


2.9 


742 


6 


CB952913 




622 


66.6 


2.9 


821 


6 


CB200045 




623 


66.4 


2.9 


571 


1 


AU243299 




624 


66.2 


2.9... 


400 


7 


C0261693 




625 


66 


2.8 


611 


6 


CA968819 




626 


66 


2.8 


1447 


7 


CV200634 


c 


627 


65.6 


2.8 


618 


9 


FR0007644 




628 


65.6 


2.8 


901 


7 


CN501635 


c 


629 


65.4 


2.8 


344 


1 


AJ283854 


c 


630 


65.4 


2.8 


697 


2 


AW942728 




631 


65.4 


2.8 


805 


5 


BU124084 




632 


65.4 


2.8 


829 


2 


BF782818 




633 


65.2 


2.8 


632 


4 


BJ528642 




634 


65.2 


2.8 


762 


1 


AA717701 




635 


65.2 


2.8 


903 


7 


CN157171 



BX889196 BX889196 
CN972726 20223_124 
CG770584 TcB41.1_E 
BM642521 170006873 
CO561098 AGENCOURT 
CR763995 DKFZp469L 
BG286763 602382611 
BY005597 BY005597 
AI265260 ui97f08.x 
CK450042 893658 MA 
BM624521 170006874 
BX879296 BX879296 
CD503770 CDA64-C01 
C0558955 AGENCOURT 
BY356244 BY356244 
AI118387 ue36a02.x 
AI118135 ue34c01.x 
BB283598 BB283598 
BG203058 RST22427 
BG876589 QV1-DT006 
BM607789 170006870 
CN976403 26464_125 
BG400151 602440934 
CN977220 27419_125 
AL694265 AL694265 
AQ194654 RPCIll-48 
AI787349 uj66f05.x 
CK838901 UI-R-ACO- 
CN750793 ApDT-XXVI 
AI386060 inml5f04.y 
CA380373 659644 NC 
CK349927 hggfha20A 
BB747310 BB747310 
CA331021 hab31bl0. 
C0573111 AGENCOURT 
CB505187 ssalmge50 
CN976669 26779^125 
AG078157 Pan trogl 
CK474315 AGENCOURT 
C0284467 EK166132. 
AW701598 uq88bl0,y 
BJ004103 BJ004103 
CB952913 AGENCOURT 
CB2 00045 AGENCOURT 
AU243299 AU243299 
C0261693 41340.4.6 B 
CA968819 CcLL03a08 
CV200634 km06el0.y 
Z91454 F.rubripes 
CN501635 AGENCOURT 
AJ283854 4A3B-AAG- 
AW942728 LD25345.3 
BU124084 603147892 
BF782818 602107680 
BJ528642 BJ528642 
AA717701 vp94g02.r 
CN157171 945851 MA 



c 


636 


65 


2.8 


572 


7 


CK604711 




637 


65 


2.8 


939 


5 


BQ922613 




638 


65 


2.8 


1109 


6 


CD505365 




639 


64.8 


2.8 


196 


1 


AI317676 




640 


64.8 


2.8 


458 


7 


C0265158 




641 


64.8 


2.8 


650 


6 


CD596098 




642 


64.8 


2.8 


713 


6 


CA347838 


c 


643 


64.8 


2.8 


815 


1 


AA986709 




644 


64.6 


2.8 


759 


4 


BJ728374 




645 


64.4 


2.8 


492 


1 


AA493091 




646 


64.4 


2.8 


647 


1 


AA880242 




647 


64.2 


2.8 


854 


4 


BI147191 




648 


64 


2.8 


463 


1 


AA895843 




649 


64 


2.8 


469 


6 


CB147253 


c 


650 


64 


2.8 


489 


1 


AA985955 




651 


64 


2.8 


512 ' 


1 


AL785003 


c 


652 


64 


2.8 


547 


7 


. C0696892 




653 


63.8 


2.8 


721 


7 


CK663177 




654 


63.8 


2.8 


891 


4 


BG426822 




655 


63.8 


2.8 


966 


2 


BF027320 


c 


656 


63.6 


2.7 


705 


6 


CB165252 




657 


63.6 


2.7 


789 


7 


CN174688 




658 


63.4 


2.7 


1716 


6 


CD013997 




659 


. 63.2 


2.7 


364 


5 


BX872852 




660 


63.2 


2.7 


729 


7 


CO570100 




661 


63.2 


2.7 


950 


7 


C0582747 




662 


63 


2.7 


1267 


4 


BG210359 


c 


663 


62.8 


2.7 


556 


2 


BE026512 




664 


62.8 


2.7 


731 


7 


CV434524 




665 


62.6 


2.7 


346 


4 


BM030341 




666 


62.6 


2.7 


494 


8 


AQ620313 


c 


667 


62.6 


2.7 


569 


6 


CA342060 




668 


62.4 


2.7 


638 


4 


BM649075 




669 


62.4 


2.7 


807 


7 


CK030429 




670 


62.4 


2.7 


886 


2 


BF788242 




671 


62.4 


2.7 


980 


4 


BG179775 




672 


62 


2.7 


352 


9 


FR0049577 




673 


62 


2.7 


605 


9 


FR0007158 


c 


674 


62 


2.7 


624 


4 


BG302826 




675 


62 


2.7 


952 


5 


BU122907 




676 


61.8 


2.7 


630 


6 


CB577476 


c 


677 


61.8 


2.7 


631 


7 


CK132736 




678 


61.8 


2.7 


888 


6 


CD759765 


c 


679 


61.6 


2.7 


504 


7 


CO262072 




680 


61.6 


2.7 


613 


1 


AA647358 


c 


681 


61.6 


2.7 


619 


9 


FRO 007 662, 


c 


682 


61.4 


2.6 


483 


9 


CL583901 




683 


61.4 


2.6 


684 


6 


CD341703 




684 


61.4 


2.6 


852 


6 


CA976637 




685 


61.2 


2.6 


368 


5 


BU473251 


c 


686 


61.2 


2.6 


596 


9 


FR0049628 




687 


61.2 


2.6 


623 


7 


CK517693 




688 


61 


2.6 


415 


9 


FR0014950 




689 


61 


2.6 


739 


4 


BG195983 




690 


61 


2.6 


1071 


5 


BU525573 




691 


60.8 


2.6 


638 


4 


BI103514 


c 


692 


60.8 


2.6 


675 


4 


BJ136139 



CK604711 UT01394.3 
BQ922613 AGENCOURT , 
CD505365 CDA73-C09 
AI317676 uj22c05.y 
C0265158 EK141664. 
CD596098 RK100A2F1 
CA347838 679019 NC 
AA986709 uc80d04.x 
BJ728374 BJ728374 
AA493091 vi71h01.r 
AA880242 w98g03.r 
BI147191 602913291 
AA895843 vy34h02.r 
CB147253 K-EST0203 
AA985955 uc72h01.x 
AL785003 AL785003 
C0696892 DG32-1041 
CK663177 A296 Symb 
BG426822 602493003 
BF027320 601671692 
CB165252 ZDW602600 
CN174688 AGENCOURT 
CD013997 90117357 
BX872852 BX872852 
CO570100 AGENCOURT 
C0582747 ILLUMIGEN 
BG210359 RST29896 
BE026512 db27e05.x 
CV434524 CS_hyp_35 
BM030341 488958 MA 
AQ620313 HS_5180_B 
CA342060 671852 NC 
BM649075 170006873 
CK030429 AGENCOURT 
BF788242 602113469 
BG179775 602328862 
AL605386 Fugu rubr 
Z90968 F.rubripes 
BG302826 fl39e03.x 
BU122907 603147125 
CB577476 AMGNNUC:U 
CK132736 RE07815.3 
CD759765 AGENCOURT 
CO262072 EK084327. 
AA647358 vp88c04.r 
Z91472 F.rubripes 

CL583901 OB Ba007 

CD341703 StrPu536. 
CA976637 AGENCOURT 
BU473251 603760179 
AL605437 Fugu rubr 
CK517693 rswjbO_00 
AL006159 F.rubripe 
BG195983 RST15066 
BU525573 AGENCOURT 
BI103514 602889055 
BJ136139 BJ136139 



c 


693 


60.8 


2. 


6 


738 


6 


CB659466 




694 


60.8 


2. 


6 


806 


1 


AI048638 


c 


695 


60.8 


2. 


6 


829 


6 


CB666604 




696 


60.6 


2. 


6 


491 


4 


BG427032 


c 


697 


60.6 


2. 


6 


590 


4 


BJ142883 




698 


60.6 


2. 


6 


641 


7 


CF254125 


c 


699 


60.6 


2. 


6 


714 


9 


AG578491 


c 


700 


60.6 


2. 


6 


728 


4 


BJ139199 


c 


701 


60.6 


2. 


6 


732 


7 


CV516957 




702 


60.4 


2. 


6 


312 


9 


FR0015816 




703 


60.4 


2. 


6 


581 


1 


AA473690 


c 


704 


60.4 


2. 


6 


638 


7 


CK940315 


c 


705 


60.4 


2. 


6 


638 


7 


CK955845 


c 


706 


60.2 


2. 


6 


811 


8 


BZ267251 




707 


60 


2. 


6 


495 


1 


AA674349 




708 


60 


2. 


6 


784 


9 


CL311817 




709 


59.8 


2. 


6 


497 


1 


AJ683303 




710 


59.8 


2. 


6 


546 


1 


AA575452 




711 


59.8 


2. 


6 


552 


1 


AI286386 




712 


59.8 


2. 


6 


728 


7 


CO641072 




713 


59.6 


2. 


6 


379 


1 


AV683166 




714 


59.6 


2. 


6 


399 


7 


T62130 




715 


59.6 


2. 


6 


997 


9 


CNS005TE 


c 


716 


59.4 


2, 


6 


605 


7 


CK604610 


c 


717 


59.4 


2. 


6 


650 


9 


CL196913 




718 


59.4 


2. 


6 


681 


7 


CN790431 


c 


719 


59.4 


2. 


6 


815 


9 


CG231386 


c 


720 


59.4 


2. 


6 


835 


9 


CG281350 




721 


59 


2. 


5 


634 


4 


BG642104 


c 


722 


59 


2. 


5 


643 


7 


CK977010 


c 


723 


59 


2. 


5 


699 


8 


AZ947666 


c 


724 


59 


2. 


5 


717 


4 


BJ147817 




725 


59 


2. 


5 


853 


2 


BF232793 




726 


59 


2. 


5 


952 


2 


BF232705 




727 


58.8 


2. 


5 


522 


7 


CR771627 




728 


58.8 


2. 


5 


542 


4 


BJ520742 


c 


729 


58.6 


2. 


5 


82 


1 


AA974589 


c 


730 


58.6 


2. 


5 


481 


1 


AI668249 




731 


58.6 


2. 


5 


484 


5 


BQ666537 


c 


732 


58.6 


2. 


5 


493 


1 


AI664846 


c 


733 


58.6 


2. 


5 


522 


1 


AI759125 


c 


734 


58.6 


2. 


5 


528 


6 


CA401029 


c 


735 


58.6 


2. 


5 


569 


6 


CA399837 


c 


736 


58.6 


2. 


5 


572 


6 


CA404541 




737 


58.6 


2. 


5 


618 


2 


AW065890 


c 


738 


5.8 . 6 


2. 


5 


621 


9 


CG303576 


c 


739 


58.6 


2. 


5 


630 


6 


CA404340 


c 


740 


58.6 


2. 


5 


656 


6 


CD446879 


c 


741 


58.6 


2. 


5 


667 


6 


CA403669 


c 


742 


58.6 


2. 


5 


683 


4 


BJ816482 




743 


58.6 


2. 


5 


685 


9 


CG296824 


c 


744 


58.6 


2. 


5 


720 


6 


CA399209 


c 


745 


58.6 


2. 


5 


720 


6 


CA401228 


c 


746 


58.6 


2. 


5 


731 


6 


CA403005 


c 


747 


58.6 


2. 


5 


749 


3 


AY112099 


c 


748 


58.6 


2. 


5 


758 


6 


CB653386 


c 


749 


58.6 


2. 


5 


769 


6 


CD446561 



CB659466 0SJNEcl6H 
AI048638 ud64c07.y 
CB666604 0SJNEdl3J 
BG427032 602493934 
BJ142883 BJ142883 
CF254125 mdvnll2_a 
AG578491 Mus muscu 
BJ139199 BJ139199 
CV516957 0048P0018 
AL007022 F.rubripe 
AA473690 vg89c08.r 
CK940315 4113683 B 
CK955845 4095851 B 
BZ267251 CH230-262 
AA674349 vp98h05.r 
CL311817 286867_LB 
AJ683303 AJ683303 
AA575452 vl90b05.r 
AI286386 ui75g08.y 
CO641072 USDA-FP_1 
AV683166 AV683166 
T62130 yc66ell.rl 
AL060767 Drosophil 
CK604610 UT01321.3 
CL196913 104_423_1 
CN790431 4125029 B 
CG231386 OGWGN83TH 
CG281350 bGYAI36TV 
BG642104 pgllc.pkO 
CK977010 4108302 B 
AZ947666 2M0210N24 
BJ147817 BJ147817 
BF232793 602023393 
BF232705 602023293 
CR771627 DKFZp469F 
BJ520742 BJ520742 
AA974589 op28hl0.s 
AI668249 605018G07 
BQ666537 pb43bl2.y 
AI664846 605002D07 
AI759125 605085D09 
CA401029 EL01N0413 
CA399837 EL01N0326 
CA404541 EL01N0519 
AW065890 687002E04 
CG30.3576 OG0BG09TH 
CA404340 EL01N0515 
CD446879 EL01T0208 
CA403669 EL01N0505 
BJ816482 BJ816482 
CG296824 OGWKC02TV 
CA399209 EL01N0315 
CA401228 EL01N0416 
CA403005 EL01N0445 
AY112099 Zea mays 
CB653386 OSJNEc04F 
CD446561 EL01T0203 



c 


750 


58.6 


2. 


5 


785 


6 


CD446551 


CD446551 


EL01T0203 


c 


751 


58.6 


2. 


5 


787 


8 


BZ637786 


BZ637786 


OGCBL09TM 


c 


752 


58.6 


2. 


5 


799 


9 


CC724533 


CC724533 


OGWIZ79TV 


c 


753 


58.6 


2. 


5 


820 


6 


CD437268 


CD437268 


EL01N0371 




754 


58.6 


2. 


5 


829 


9 


CG083392 


CG083392 


PUILH08TD 


c 


755 


58.6 


2. 


5 


876 


6 


CD441044 


CD441044 


EL01N0551 




,756 


58.4 


2. 


5 


942 


7 


CN990976 


CN990976 


67253 125 


c 


757 


58.2 


2. 


5 


704 


1 


AU213104 


AU213104 


AU213104 




758 


58.2 


2. 


5 


1465 


6 


CD013999 


CD013999 


90117633 




759 


58 


2. 


5 


523 


1 


AA511527 


. AA511527 


vj28d05.r 




760 


58 


2. 


5 


566 


5 


BP221553 


BP221553 


BP221553 


c 


761 


58 


2. 


5 


842 


7 


C0161727 


C0161727 


FLDl 30 F 


c 


762 


58 


2. 


5 


851 


7 


CO161650 


CO161650 


FLD1_30_F 


c 


763 


57.8 


2. 


5 


461 


1 


AI256492 


AI256492 


ui87a02.x 


c 


764 


57.6 


2. 


5 


355 


9 


CC465888 


CC465888 


ZMMBBc039 




765 


57.6 


2. 


5 


391 


4 


BG186196 


BG186196 


RST5044 A 


c 


766 


57.6 


2. 


5 


578 


9 


CG301141 


CG301141 


0GXCC14TH 




767 


57.6 


2. 


5 


686 


6 


CD444925 


CD444925 


EL01N0445 




768 


57.6 


2. 


5 


823 


4 


BI332351 


BI332351 


602983540 




769 


57.4 


2. 


5 


628 


1 


AV657530 


AV657530 


AV657530 


c 


770 


57.2 


2. 


5 


346 


8 


BZ348795 


BZ348795 


hp61d04.g 


c 


771 


57.2 


2. 


5 


538 


1 


AL915629 


AL915629 


AL915629 




772 


57.2 


2. 


5 


558 


1 


AU201532 


AU201532 


AU201532 




773 


57.2 


2. 


5 


624 


2 


BE238993 


BE238993 


MD0649 Me 




774 


57.2 


2. 


5 


629 


7 


CK498980 


CK498980 


rswbbO 00 




775 


57 


2. 


5 


437 


7 


CF919695 


CF919695 


Bflor531. 


c 


776 


57 


2. 


5 


447 


6 


CD441431 


CD441431 


EL01N0556 


c 


777 


57 


2. 


5 


700 


4 


BJ148891 


BJ148891 


BJ148891 




778 


57 


2. 


5 


1101 


9 


CNSOOKFD 


AL077908 


Drosophil 




779 


56.8 


2. 


4 


440 


2 


BB745467 


•BB745467 


BB745467 




780 


56.8 


2. 


4 


445 


7 


CN955753 


CN955753 


3252_104- 




781 


56.8 


2. 


4 


601 


7 


CO637012 


CO637012 


USDA-FP_1 




782 


56.8 


2. 


4 


674 


8 


BZ328006 


BZ328006 


id33gl0.g 


c 


783 


56.8 


2. 


4 


807 


8 


BZ220205 


BZ220205 


CH230-360 




784 


56.8 


2. 


4 


837 


9 


CC659620 


CC659620 


OGXAA96TV 


c 


785 


56.8 


2. 


4 


880 


9 


CC616166 


CC616166 


0GLCB41TH 




786 


56.8 


2. 


4 


1024 


3 


AY107409 


AY107409 


Zea mays 


c 


787 


56.6 


2. 


4 


583 


1 


AI876537 


AI876537 


uj59fll.x 




788 


56.6 


2. 


4 


624 


8 


CC386454 


CC386454 


PUHPE4 9TD 


c 


789 


56.6 


2. 


4 


628 


8 


CC386449 


CC386449 


PUHPE49TB 




790 


56.6 


2. 


4 


872 


7 


CN582253 


CN582253 


USDA-FP_1 




791 


56.4 


2. 


4 


632 


6 


CD748367 


CD748367 


rw36d05.y 


c 


792 


56.4 


2. 


4 


732 


4 


BJ816728 


BJ816728 


BJ816728 


c 


793 


56.4 


2. 


4 


743 


4 


BJ816263 


BJ816263 


BJ816263 


c 


794 


56.4 


2. 


4 


749 


4 


BJ152595 


BJ152595 


BJ152595 


c 


795 


56.4 


2. 


4 


7m 


4 


BJ779102 


BJ779102 


BJ779102 


c 


796 


56.4 


2. 


4 


775 


4 


BJ149607 


BJ149607 


BJ149607 


c 


797 


56.4 


2. 


4 


111 


4 


BJ129204 


BJ129204 


BJ129204 




798 


56.4 


2. 


4 


814 


4 


BG428781 


BG428781 


602500860 


c 


799 


56.2 


.2. 


4 


426 


7 


CO308066 


CO308066 


EK256548. 


c 


800 


56.2 


2. 


4 


629 


6 


CB494209 


CB494209 


omykrbhbO 


c 


801 


56.2 


2. 


4 


793 


4 


BJ149397 


BJ149397 


BJ149397 




802 


56.2 


2. 


4 


809 


4 


BG971667 


BG971667 


602840766 




803 


56.2 


2. 


4 


1101 


9 


CNSOOOWU 


AL053681 


Drosophil 




804 


56 


2. 


4 


436 


7 


CO316063 


CO316063 


EK263564. 




805 


56 


2. 


4 


488 


3 


AY432118 


AY432118 


Aedes aeg 


c 


806 


56 


2. 


4 


504 


9 


CG801172 


CG801172 


1118018D0 





807 


56 


2.4 


568 


5 


BP213630 


BP213630 


BP213630 


c 


808... 


56 


2.4 


695 


4 


BJ776790 


BJ776790 


BJ776790 


c 


809 


56 


2.4 


785 


9 


CC659613 


CC659613 


OGXAA96TH 




810 


56 


2.4 


900 


5 


BU123073 


BU123073 


603149488 




811 


55.8 


2.4 


221 


2 


BB561857 


BB561857 


BB561857 




812 


55.8 


2.4 


316 


7 


CK942201 


CK942201 


4065834 B 


c 


813 


55.8 


2.4 


619 


9 


FR0015790 


AL006996 


F. rubripe 


c 


814 


55.8 


2.4 


704 


1 


AI529566 


AI529566 


ui82b06.x 




815 


55.8 


2.4 


1012 


9 


CNS0050V 


AL056586 


Drosophil 




816 


55.8 


2.4 


1266 


9 


CL962544 


CL962544 


OSIFCC007 




817 


55.6 


2.4 


353 


2 


AW424655 


AW424655 


707020H03 


c 


818 


55.6 


2.4 


454 


4 


BJ817721 


BJ817721 


BJ817721 




819 


55.6 


2.4 


521 


4 


BM130311 


BM130311 


pb29dl2.y 




820 


55.6 


2.4 


525 


2 


BE138067 


BE138067 


ug48a07.y 




821 


55.6 


2.4 


583 


5 


BX765624 


BX765624 


BX765624 




822 


55.6 


2.4 


633 


5 


BQ588578 


BQ588578 


E012562-0 


c 


823 


55.6 


2,4 


716 


1 


AU216588 


AU216588 


AU216588 




824 


55.6 


2.4 


753 


4 


BI143423 


BI143423 


602907790 




825 


55.6 


2.4 


813 


6 


CA480775 


CA480775 


AGENCOURT 




826 


55.6 


2.4 


984 


9 


CNS006LF 


AL065715 


Drosophil 




827 


55.6 


2.4 


1009 


9 


CNS007PY 


AL067477 


Drosophil 


c 


828 


55.4 


2.4 


309 


9 


CL583240 


CL583240 


OB Ba004 




829 


55.4. 


.2.4 


548 


7 


CK886207 


CK886207 


SGP166618 


c 


830 


55.4 


2.4 


675 


4 


BG320141 


BG320141 


Zm03 05c0 


c 


831 


55.2 


2.4 


592 


7 


CO184970 


CO184970 


EC29003.3 




832 


55.2 


2.4 


713 


7 


C0638381 


C0638381 


USDA-FP 1 




833 


55.2 


2.4 


845 


6 


CA481394 


CA481394 


AGENCOURT 




834 


54.8 


2.4 


507 


6 


CB164110 


CB164110 


K-EST0225 




835 


54.8 


2.4 


570 


5 


BP257713 


BP257713 


BP257713 


c 


836 


54.8 


2.4 


573 


9 


FR0049536 


AL605345 


Fugu rubr 


c 


837 


54.8 


2.4 


619 


9 


FR0015778 


AL006985 


F. rubripe 


c 


838 


54.8 


2.4 


734 


4 


BJ794516 


BJ794516 


BJ794516 




839 


54.6 


2.4 


584 


1 


AI987137 


AI987137 


rs22b08.y 




840 


54.6 


2.4 


603 


5 


BX499005 


BX499005 


DKFZp779A 




841 


54.6 


2.4 


619 


6 


CA100959 


CA100959 


SCCCFLIOO 




842 


54.6 


2.4 


644 


5 


BQ578506 


BQ578506 


WHE0304 F 




843 


54.6 


2.4 


739 


4 


BG969762 


BG969762 


602837258 




844 


54.4 


2.3 


464 


6 


CB953456 


CB953456 


AGENCOURT 




845 


54.4 


2.3 


542 


8 


BH778649 


BH778649 


fzmb013f0 


c 


846 


54.4 


2.3 


604 


5 


BU0509.66 


BU050966 


1111036G0 




847 


54.4 


2.3 


661 


7 


CF623828 


CF623828 


znu:ws05_0 




848 


54.4 


2.3 


662 


7 


CK827797 


CK827797 


zmrws05 0 




849 


54.4 


2.3 


737 


9 


CG370415 


CG370415 


OGWIZ79TH 




850 


54.2 


2.3 


372 


1 


AA675719 


AA675719 


MBAFCW4A0 


c 


851 


54.2 


2.3 


574 


4 


BJ538530 


BJ538530 


BJ538530 




852 


54.2 


2.3 


576 


4 


BJ525957 


BJ525957 


BJ525957 




853 


54.2 


2.3 


628 


2 


BF787599 


BF787599 


602114528 




854 


54.2 


2.3 


1194 


7 


CV186196 


CV186196 


A-013 D. 




855 


54 


2.3 


427 


7 


C0285751 


C0285751 


EK060617. 




856 


54 


2.3 


463 


6 


CD915387 


CD915387 


G550.125P 




857 


54 


2.3 


580 


5 


BP215685 


BP215685 


BP215685 


c 


858 


54 


2.3 


916 


7 


CV280250 


CV280250 


WS0135.B2 


c 


859 


53.8 


2.3 


527 


6 


CA405139 


CA405139 


EL01N0529 




860 


53.8 


2.3 


593 


4 


BM143646 


BM143646 


saj47a09. 


c 


861 


53.8 


2.3 


825 


7 


C0981254 


C0981254 


GM89010B2 


c 


862 


53.6 


2.3 


446 


2 


AW113351 


AW113351 


MC798 mou 


c 


863 


53.6 


2.3 


635 


1 


AU219747 


AU219747 


AU219747 



c 


864 


53.6 


2.3 


816 


7 


CR530976 


CR530976 CR530976 


c 


865 


53.6 


2.3. 


836 


1 


AI529553 


AI529553 ui82a02.x 




866 


53.6 


2.3 


888 


6 


CA975538 


CA975538 AGENCOURT 




867 


53.6 


2.3 


954 


7 


CO580598 


CO580598 ILLUMIGEN 


c 


868 


53.6 


2.3 


1014 


9 


CNS05PJQ 


AL348047 Tetraodon 


c 


869 


53.4 


2.3 


253 


2 


AW531318 


AW531318 UI-R-C4-a 


c 


870 


53.4 


2.3 


360 


9 


FR0015823 


AL007029 F.rubripe 




871 


53.4 


2.3 


574 


7 


CN133765 


CN133765 OXl_18_E0 




872 


53.4 


2.3 


629 


6 


CB927938 


CB927938 ABAl 35 B 




873 


53.4 


2.3 


687 


1 


AU055875 


AU055875 AU055875 




874 


53.4 


2.3 


733 


9 


CR501049 


CR501049 Medicago 




875 


53.4 


2.3 


736 


9 


CR500903 


CR500903 Medicago 


c 


876 


53.4 


2.3 


784 


7 


CK265007 


CK265007 EST711085 




877 


53.4 


2.3 


820 


4 


BG577461 


BG577461 R77 MPIZ- 




878 


53.4 


2.3 


822 


9 


CR500725 


CR500725 Medicago 


c 


879 


53.4 


2.3 


875 


9 


CC692091 


CC692091 OGLBU37TH 




880 


53.4 


2.3 


875 


9 


CC692097 


CC692097 OGLBU37TV 




881 


53.4 


2.3 


945 


4 


BG167885 


BG167885 602339857 


c 


882 


53.2 


2.3 


552 


2 


AW351201 


AW351201 GM210011A 


c 


883 


53.2 


2.3 


552 


6 


CB399450 


CB399450 OSTR221F2 


c 


884 


53.2 


2.3 


552 


6 


CB399482 


CB399482 OSTR221F2 


c 


885 


53.2 


2.3 


700 


2 


BE190894 


BE190894 sn80c04.y 


c 


886 


53.2 


2.3 


738 


4 


BJ775701 


BJ775701 BJ775701 




887 


53.2 


2.3 


743 


2 


BF236932 


BF236932 602026557 


c 


888 


53.2 


2.3 


769 


4 


BJ791007 


BJ791007 BJ791007 




889 


53.2 


2.3 


873 


7 


CN167653 


CN167653 AGENCOURT 




890 


53.2 


2.3 


914 


7 


C0645579 


C0645579 ILLUMIGEN 


c 


891 


53 


2.3 


467 


1 


AI665286 


AI665286 605009C03 




892 


53 


2.3 


602 


6 


CA248078 


CA248078 SCCCAM109 




893 


53 


2.3 


603 


9 


FR0007962 


Z91772 F.rubripes 




894 


53 


2.3 


614 


6 


CA242675 


CA242675 SCSFFL309 


c 


895 


53 


2.3 


925 


7 


CN761277 


CN761277 ID0AAA2DE 




896 


52.8 


2.3 


447 


5 


BU039808 


BU039808 PP LEaOOO 




897 


52.8 


2.3 


499 


4 


BI610180 


BI610180 RH15147.5 




898 


52.8 


2.3 


514 


5 


BM952830 


BM952830 952059E04 




899 


52.8 


2.3 


652 


8 


AZ945505 


AZ945505 2M0206H23 


c 


900 


52.8 


2.3 


672 


4 


BJ135701 


BJ135701 BJ135701 


c 


901 


52.8 


2.3 


755 


4 


BJ809035 


BJ809035 BJ809035 


c 


902 


52.6 


2.3. 


573 


8 


AZ831866 


AZ831866 2M0111C19 




903 


52.6 


2.3 


619 


9 


FR0007177 


Z90987 F.rubripes 




904 


52.6 


2.3 


992 


9 


CNS038FN 


AL232556 Tetraodon 


c 


905 


52.4 


2.3 


464 


9 


CC693642 


CC693642 OGUGK52TH 




906 


52.4 


2.3 


504 


6 


CA7 00350 


CA700350 wkmlc.pkO 




907 


52.4 


2.3 


519 


6 


CA627167 


CA627167 wlln.pkl5 


c 


908 


52.4 


2.3 


528 


7 


CV051393 


CV051393 EST 10848 




909 ' 


5,2 . 4 


2.3 


544 


6 


CA621946 


CA62a946 wlln.pkOO 




910 


52.4 


2.3 


545 


5 


BQ483975 


BQ483975 WHE3515__A 


c 


911 


52.4 


2.3 


551 


9 


CG233803 


CG233803 OGVCV89TH 




912 


52.4 


2.3 


573 


7 


CV053322 


CV053322 EST 12770 




913 


52.4 


2.3 


593 


7 


CV053078 


CV053078 EST 12526 




914 


52.4 


2.3 


619 


9 


FR0007182 


Z90992 F.rubripes 


c 


915 


52.4 


2.3 


623 


9 


CNS03MFA 


AL250687 Tetraodon 


c 


916 


52.4 


2.3 


638 


7 


CV051636 


CV051636 EST 11091 


c 


917 


52.4 


2.3 


666 


1 


AI158921 


AI158921 uf05c02.x 


c 


918 


52.2 


2.3 


443 


8 


CC029515 


CC029515 3591_1 11 


c 


919 


52.2 


2.3 


658 


7 


CO210550 


CO210550 WS00918.B 




920 


52 


2.2 


272 


9 


CL175246 


CL175246 104 380 1 





921 


52 


2.2 


343 


4 


BG182337 


BG182337 


RST1203 A 




922 


52 


2.2 


536 


7 


CF349411 . 


CF349411 


fc2486.e 




923 


52 


2.2 


582 


5 


BP217111 


BP217111 


BP217111 




924 


52 


2.2 


631 


6 


CA111498 


CA111498 


SCAGLB107 


c 


925 


52 


2.2 


703 


5 


BQ475642 


BQ475642 


cicindela 


c 


926 


52 


2.2 


721 


5 


BQ262583 


BQ262583 


fz84dl2.x 




927 


52 


2.2 


803 


1 


AJ006537 


AJ006537 


AJ006537 


c 


928 


51.8 


2.2 


496 


1 


AI855338 


AI855338 


603014F06 




929 


51.8 


2.2 


508 


1 


AI614365 


AI614365 


vi71h01.y 


c 


930 


51*8 


2.2 


511 


7 


C0528524 


C0528524 


3530 1 18 




931 


51.8 


2.2 


543 


6 


CB339758 


CB339758 


CA23EI03I 


c 


932 


51.8 


2.2 


574 


6 


CB339351 


CB339351 


CA23EI03I 


c 


933 


51.8 


2.2 


,610 


4 


BI542629 


BI542629 


949020C07 


c 


934 


51.8 


2.2 


616 


7 


C0517693 


C0517693 


3530__1_11 


c 


935 


51.8 


2.2 


659 


7 


CF630818 


CF630818 


zitirws 48 0 




936 


51.8 


2.2 


687 


6 


CB250778 


CB250778 


75 Aedes 


c 


937 


51.8 


2.2 


690 


7 


CF243936 


CF243936 


3530 1 25 




938 


51.8 


2.2 


695 


7 


CK962027 


CK962027 


4076363 B 




939 


51.8 


2.2 


724 


9 


CL154714 


CL154714 


104_340_1 




940 


51.8 


2.2 


748 


6 


CB954704 


CB954704 


AGENCOURT 


c 


941 


51.8 


2.2 


768 


6 


CB631476 


CB631476 


OSIIEb09A 




942 


51.8 


2.2 


798 


5 


BM948419 


BM948419 


UI-M-EGOp 


c 


943 


51.8 ■ 


2.2 


822 


7 


CF633893 


CF633893 


zmrwwOO 0 




944 


51.8 


2.2 


827 


7 


CK370475 


CK370475 


zmrwwOOS 




945 


51.8 


2.2 


840 


6 


CB631477 


CB631477 


OSIIEb09A 


c 


946 


51.8 


2.2 


892 


4 


BG208263 


BG208263 


RST27755 




947 


51.8 


2.2 


893 


6 


CA076965 


CA076965 


SCQGAM104 




948 


51.8 


2.2 


949 


3 


AY105440 


AY105440 


Zea mays 


c 


949 


51.8 


2.2 


1049 


2 


BE573200 


BE573200 


601332887 




950 


51.8 


.2.2 


1419 


9 


CL965868 


CL965868 


OSIFCC012 


c 


951 


51.6 


2.2 


335 


8 


CC040482 


CC040482 


3591 1 13 


c 


952 


51.6 


2.2 


440 


8 


CC040369 


CC040369 


3591 1 13 




953 


51.6 


2.2 


442 


7 


CN956527 


CN956527 


4027 5001 


c 


954 


51.6 


2.2 


454 


8 


CC040451 


CC040451 


3591 1 13 


c 


955 


51.6 


2.2 


461 


8 


CC042725 


CC042725 


3591_1_15 


c 


956 


51.6 


2.2 


545 


6 


CB505278 


CB505278 


ssalnigd50 


c 


957 


51.6 


2.2 


572 


2 


AW471171 


AW471171 


xu08h02.x 




958 


51.6 


2.2 


633 


2 


BF028687 


BF028687 


601763929 


c 


959 


51.6 


2.2 


667 


7 


C0519314 


C0519314 


3530 1 12 


c 


960 


51.6 


2.2 


671 


4 


BJ146707 


BJ146707 


BJ146707 




961 


51.6 


2.2 


728 


4 


BIl 02440 


BI 102440 


602889631 


c 


962 


51.6 


2.2 


729 


9 


CG436794 


CG436794 


OGVGU72TH 


c 


963 


51.6 


2.2 


111 


9 


CG267261 


CG267261 


0GWHF51TH 




964 


51.6 


2.2 


781 


6 


CB683695 


CB683695 


OSJNEf 12G 


c 


965 


51.6 


2.2 


834 


7 


CF634127 


CF634127 


zmrwwO 0 0 




966 


51.6 


2.2 


854 


1 


.-.AL931420 


AL931420 


AL931420 


c 


967 


51.6 


2.2 


899 


9 


CC618081 


CC618081 


OGLAX62TH 




968 


51.6 


2.2 


1031 


6 


CA139432 


CA139432 


SCEQRT209 




969 


51.4 


2.2 


494 


9 


CL572955 


CL572955 


OB BaOOl 




970 


51.4 


2.2 


568 


6 


CD433794 


CD433794 


EL01N0315 


c 


971 


51.4 


2.2 


649 


8 


BZ726909 


BZ726909 


0GECF17TC 




972 


51.4 


2.2 


654 


7 


CN150564 


CN150564 


WOUND1_70 




973 


51.4 


2.2 


666 


5 


BQ753201 


BQ753201 


WHE4124 C 


c 


974 


51.4 


2.2 


673 


4 


BJ783526 


BJ783526 


BJ783526 




975 


51.4 


2.2 


673 


9 


CL537275 


CL537275 


OB Ba004 




976 


51.4 


2.2 


679 


6 


CA073196 


CA073196 


SCEPAM105 


c 


977 


51.4 


2.2 


690 


1 


AU216711 


AU216711 


AU216711 



c 978 


51.4 


2 


.2 


702 


4 


BJ788990 


BJ788990 


BJ788990 


979 


51.4 


2 


.2 


707 


8 


BZ726916 


BZ726916 


0GECF17TM 


c 980 


51.4 


2 


.2 


708 


4 


BJ133121 


BJ133121 


BJ133121 


c 981 


51.4 


2 


.2 


709 


4 


BJ818107 


BJ818107 


BJ818107 


c 982 


51.4 


2 


.2 


763 


6 


CA135704 


CA135704 


SCJLRT102 


c 983 


51.4 


2 


.2 


799 


6 


CD871509 


CD871509 


AZ02.118G 


984 


51.4 


2 


.2 


919 


9 


CG445266 


CG445266 


OGTBF39TV 


985 


51.4 


2 


.2 


1127 


7 


CK028276 


CK028276 


AGENCOURT 


986 


51.2 


2 


.2 


532 


9 


CC733695 


CC733695 


0GUC018TH 


987 


51.2 


2 


.2 


539 


6 


CD750145 


CD750145 


rv07c03.y 


c 988 


51.2 


2 


.2 


562 


9 


CC733706 


CC733706 


0GUC018TV 


989 


51.2 


2 


.2 


565 


5 


BX676981 


BX676981 


BX676981 


990 


51.2 


2 


.2 


582 


7 


C0997967 


C0997967 


pam01-15m 


991 


51.2 


2 


.2 


707 


7 


CO070063 


CO070063 


GR Ea260 


c 992 


51.2 


2 


.2 


784 


7 


CO070062 


CO070062 


GR Ea260 


c 993 


51.2 


2 


.2 


790 


7 


CO084607 


CO084607 


GR EaOlF 


994 


51.2 


2 


.2 


793 


9 


CG205456 


CG205456 


PUIAU4 4TD 


c 995 


51.2 


2 


.2 


1031 


9 


CC730025 


CC730025 


OGUDN62TV 


996 


51.2 


2 


.2 


1458 


9 


CL982528 


CL982528 


OSIFSC048 


997 


51 


2 


.2 


430 


5 


BP940165 


BP940165 


BP940165 


998 


51 


2 


.2 


586 


9 


FR0007952 


Z91762 F. 


. rubripes 


c 999 


51 


2 


.2 


596 


5 


BP036706 


BP036706 


BP036706 


clOOO 


51 


2 


.2 


608 


6 


CB637106 


CB637106 


OSJNEa03E 


1001 


51 


2 


.2 


717 


1 


AF527714 


AF527714 


AF527714 


C1002 


51 


2 


.2 


750 


6 


CB646078 


CB646078 


OSJNEb08E 


C1003 


51 


2 


.2 


768 


6 


CB655823 


CB655823 


OSJNEC09J 


C1004 


51 


2 


.2 


769 


6 


CB680689 


CB680689 


OSJNEf05N 


C1005 


51 


2 


.2 


807 


6 


CB679495 


CB679495 


OSJNEf 03B 


C1006 


51 


2 


.2 


809 


6 


'CB679818 


CB679818 


OSJNEf03M 


1007 


51 


2 


.2 


810 


6 


CB680688 


CB680688 


OSJNEf05N 


C1008 


51 


2 


.2 


812 


6 


CB671618 


CB671618 


OSJNEe05E 


1009 


51 


2 


.2 


821 


6 


CB655822 


CB655822 


OSJNEC09J 


clOlO 


51 


2 


.2 


829 


6 


CB666716 


CB666716 


0SJNEdl3L 


1011 


50.8 


2 


.2 


265 


7 


CV052826 


CV052826 


EST 12274 


C1012 


50.8 


2 


.2 


494 


7 


CV047765 


CV047765 


EST 13082 


1013 


50.8 


2 


.2 


551 


6 


CA041628 


CA041628 


ssalplnb5 


C1014 


50.8 


2 


.2 


578 


7 


C0698343 


C0698343 


DG32-134j 


1015 


50.8 


2 


.2 


580 


5 


BP276996 


BP276996 


BP276996 


1016 


50.8 


2 


.2 


582 


5 


BP270998 


BP270998 


BP270998 


1017 


50.8 


2 


.2 


609 


6 


CD224613 


CD224613 


CCC1_35_F 


C1018 


50.8 


2 


.2 


647 


4 


BM269408 


BM269408 


MEST409-F 


C1019 


50.8 
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AY404343 1182 bp DNA linear GSS 12-DEC-2003 

Homo sapiens HCM1859 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY404343 

AY404343 . 1 GI : 39760320 
GSS. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1182) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang>G., Zheng, X.H,, White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp-mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 1182) 

Clark,A.G., Glanowski, S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
Direct Submission 

Submitted ( 16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. ^ 

Location/Qualifiers 

1. .1182 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon:9606" 
<1. .>1182 

/locus tag="HCM1859" 



Query Match 50.9%; 
Best Local Similarity 99.9%; 



Score 1180.4; DB 9; Length 1182; 
Pred. No. 0; 



Matches 1181; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 458 ATGGATTCCTTA7\AGAATGAGAACTTCGACATGGTGATAGTTGAAACTTTTGACTACTGT 517 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 1 ATGGATTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACTTTTGACTACTGT 60 

Qy 518 CCTTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTC 577 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CCTTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTC 120 

Qy 578 GGCTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTG 637 

I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GGCTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTG 180 

Qy 638 CTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTC 697 

I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTC 240 

Qy 698 TGCAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAGGAACATTTCACA 757 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 241 TGCAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAGGAACATTTCACA 300 

Qy 758 GAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAAC 817 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAAC 360 

Qy 818 TCTGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGC 877 

I I I I. I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TCTGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGC 420 

Qy 878 TTGATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTT 937 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 
Db 421 TTGATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTT 480 

Qy 938 GGGGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCG 997 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 GGGGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCG 540 

Qy ■ 998 GAAATCTTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAG 1057 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GAAATCTTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAG 600 

Qy 1058 TGTCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGAC 1117 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 601 TGTCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGAC 660 

Qy 1118 TGGCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGC 1177 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 TGGCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGC 720 

Qy 1178 GGGCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTC 1237 

I I I I I I I I M I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 721 GGGCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTC 780 



Qy 

Db 



1238 TTTGGAGACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATT 1297 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 
781 TTTGGAGACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATT 840 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1298 CAGTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGAC 1357 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 n I I I I 

841 CAGTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGAC 900 

1358 AAGAGATACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGC 1417 
I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

901 T^GAGATACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGC 960 

1418 CCCACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCAC 1477 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCCACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGAGGCAC 1020 



961 



1537 



1478 CTCAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I hi I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 CTCAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGCTCGACGTTTTTGTG 108 0 

1538 TTTCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCT 1597 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1081 TTTCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCT 1140 



1598 



1639 



GTCTGGTGGCTGCGTGGGGCCAGAAAGGTGAAGGAGACATAA 
I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1141 GTCTGGTGGCTGCGTGGGGCCAGAAAGGTGAAGGAGACATAA 1182 
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LOCUS 
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ACCESSION 
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KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
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TITLE 
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PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
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AK050128 2197 bp itiRNA linear HTC 03-APR-2004 

Mus musculus adult male liver tumor cDNA, RIKEN full-length 
enriched library, clone : C730018P11 product : hypothetical 
UDP-glucoronosyl and UDP-glucosyl transferase containing protein, 
full insert sequence. 
AK050128 

AK050128.1 GI: 26340835 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata, K. , 



Itoh,M., Aizawa,K. 
Konno,H., Akiyama,J., Nishi,K. 



Nagaoka,S., Sasaki, N., Carninci,P. 
Kitsunai,T., Tashiro,H., Itoh,M., 



Sumi^N., Ishii^Y., Nakamura, S . , Hazama^M., Nishine^T., Harada^A. , 
Yamamoto, R. , Matsumoto, H..., Sakaguchi, S . , Ikegami, T . , . Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa^Y., Izawa^M., Ohara,E., Watahiki,M., 
Yoneda^Y., Ishikawa^T., Ozawa^K., Tanaka^T., Matsuura^S.^ Kawai,J., 
Okazaki,Y.^ Muramatsu,M. , Inoue,Y., Kira^A. and Hayashizaki, Y. 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-fontiat 

sequencing pipeline with 384 multicapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 

MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 

FANTOM Consortium. 
TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409, 685-690 (2001) 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 2197) 

AUTHORS Adachi,J., Aizawa^K., Akimura,T., Arakawa,T., Bono,H., Carninci^P., 
Fukuda,S., Furuno,M. , Hanagaki^T., Kara, A., Hashizume, W. , 
Hayashida,K. , Hayatsu^N., Hiramoto^K., Hiraoka^T., Hirozane,T., 
Hori,F., Imotani,K., Ishii^Y., Itoh,M., Kagawa,I., Kasukawa^T., 
Katoh^H., Kawai,J., Kojima^Y., Kondo^S., Konno^H., Kouda,M. , 
Koya,S., Kurihara^C, Matsuyama, T. , Miyazaki^A. , Murata,M., 
Nakamura,M., Nishi^K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N,, 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C., Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa,A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted (16-JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc. riken. jp, 
URL: http: //genome. gsc. riken. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to ^ 
prepare mouse tissues. 

Tissue was provided by William A. Held, Roswell Park Cancer 
Institute, Department of Molecular and Cellular Biology, Elm and 
Carlton Streets, Buffalo, NY 14263, whose assistance we gratefully 
acknowledge. 

Please visit our web site for further details. 
URL: http: / /genome . gsc . riken. jp/ 
URL : http : / / f antom. gsc . riken . jp/ . 
FEATURES Location/Qualifiers 
source 1. .2197 

/organism="Mus musculus" 



/mol_t ype= "mRNA" 

/strain="C57BL/6J" 

/db^xr e f = " FANTOM^DB :C730018P11" 

/db_xref =" taxon : 10090" 

/clone="C730018Pll" 

/sex="male" 

/ eel l_type=" tumor" 

/tissue_type=" liver" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage=" adult" 
CDS 99. .1670 

/note="unnamed protein product; hypothetical 

UDP-glucoronosyl and UDP-glucosyl transferase containing 

protein (InterPro | IPR002213, evidence: InterPro) 

putative " 

/codon_start=l 

/protein_id="BAC34080. 1" 

/db_xref="GI: 26340836" 

/translation="l^AAHRRWLLMSFLFLEVILLEAAKILTISTLSASHYIVISRVSQ 
VLHEGGHNVTKLLYESANIPDFRKEKPSYQVINWRPPEDQEKKFADLRHRLTEEITYG 
RSKHHTLLKIHQYFGDLCSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKR 
FVSFLPFQFSYMDFGLPSAPLSYAPVYGSGLTDQMDFWGRVKNFLMFLDFSMKQREIL 
SQYDSTIQEHFVEGSQPVLSDLLLKAELWFVNSDFALDFTVRPLFPNTVYVGGLLDKPV 
QPIPQDLENFISQFGDSGFVLVALGSIVSMIQSKEIIKEMNSAFAHLPQGVLWTCKTS 
HWPKDVSLAPNVKIMDWLPQTDLLAHPSIRLFVTHGGMNSVMEAVHHGVPMVGIPFFF 
DQPENMVRVEAKNLGVSIQLQTLKAESFALTMKKIIEDKRYKSAAMASKIIRHSHPLT 
PAQRLLGWIDHILQTGGAAHLKPYAFQQPWHEQYMLDVFLFLLGLMLGTLWLSVKVLV 
AVTRYLSIATKVKEA" 

polyA_signal 2178. .2183 

/note="putative" 

polyA_site 2197 

/note= "putative " 

ORIGIN 



Query Match 39.3%; Score 912.2; DB 3; Length 2197; 

Best Local Similarity 73.0%; Pred. No. 2.8e-247; 

Matches 1201; Conservative 0; Mismatches 438; Indels 6; Gaps 2; 

Qy 1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 32 AGGGTCCCTTAGTGGGGCACAGCTCTTTAGGTGTGCAACCTGTGTCAGGGCTCCACATAT 91 

Qy 61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 92 TCTACAGATGGCAGCACATCGGCGTTGGCTTCTCATGAGCTTCCTTTTCCTTGAGGTTAT 151 

Qy 121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II II I I II II II I 
Db 152 TCTCTTGGAGGCTGCAAAAATCCTGACTATATCTACACTGAGTGCAAGCCATTATATAGT 211 

Qy 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

III I II I I I I I I I I I I I II I I I I II I I I I I I I I I III I 

Db 212 GATAAGCCGTGTGTCGCAAGTTCTTCATGAAGGTGGCCACAATGTGACCAAACTTCTTTA 271 



Qy 

Db 



241 
272 



CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 

III II I I I II I I II I I II I I I I I II II II I I I I I I I I I I I I I I I I 
TGAAA GTGCTAATATCCCAGATTTTAGAAAGGAAA/SiACCATCATATCAAGTTATTAA 



300 
328 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I I III I I I II I I II II II 

329 TTGGCGTCCACCTGAAGATCAGGAAAAGAAATTTGCTGATCTTAGGCATCGACTTACAGA 388 

361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

II I II I I I II I III I I I I I I I I I II I I I I I I I I 

38 9 AGAAATAACTTATGGCAGGTCCAAACATCACACCCTTCTAAAGATCCATCAATACTTTGG 448 

421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

449 GGACTTATGCAGTCAATTATTAAGCAGAAAGGACATCATGGACTTCTTAAAAAATGAGAA 508 

481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I I I I I I I I I I I I III I I I I I I II II I I I I I I I I II II 
509 CTTTGACCTAGTACTTCTTGATTCAATGGATCTCTGTTCTTTGCTAATTGTTGAAAAGCT 568 

541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I M I I I I I I I I I I III I I I I I I I I I M I I I I I M 

569 TGGGAAACGATTTGTGTCCTTTCTTCCCTTTCAATTTAGCTATATGGACTTTGGGTTACC 628 

601 AA TCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTT 657 

II I I I I II I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

629 AAGCGCCCCCTTGTCCTATGCTCCAGTGTATGGTTCTGGTCTAACTGACCAAATGGACTT 688 

658 CTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACAT 717 

II I I I I I I I I I I I I I I I I I I I I M I I II I I II I I I I I II II I I I I I I II 
689 CTGGGGCCGAGTGAAGAACTTTCTGATGTTCCTTGATTTCTCCATGAAGCAAAGGGAAAT 748 

718 GCAGTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTT 777 

I III I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I 
74 9 CCTTTCTCAGTATGACAGCACTATCCAGGAGCATTTTGTGGAAGGCTCTCAGCCAGTGTT 808 

77 8 GTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTT 837 

I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
809 GTCTGACCTTCTACTGAAAGCTGAGCTGTGGTTTGTCAACTCTGACTTTGCTTTGGATTT 868 

838 TGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATT7\A 897 
MINI II III I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I II I 

869 TGCTCGTCCCCTGTTTCCCAACACAGTCTATGTGGGAGGTTTACTGGACAAACCTGTTCA 928 

898 ACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCT 957 

III I I I I I I I I II I I I I I I I I II II I I I II I I I I I I I I I I I I I I I I I I I 
929 GCCAATACCCCAAGACTTGGAGAATTTTATCTCTCAGTTTGGAGACTCAGGTTTTGTCCT 988 

958 TGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGT^AATCTTCAAGGAGATGAA 1017 

I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I 

989 TGTGGCCCTGGGCTCTATAGTGAGCATGATTCAGTCCAAGGAAATTATTAAGGAGATGAA 1048 

10 1 8 CAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCC 1077 

II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I III II I I I I I I I I 1 I I I 
1049 CAGTGCCTTTGCTCACCTCCCTCAAGGGGTGCTATGGACATGTAAGACTTCTCATTGGCC 1108 

1078 CAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCT 1137 

I I I I I I I I I I I I I I I I I I I I I I I I I I I IN I I I I I I I I I I I I I I I I I I 

1109 CAAAGATGTCAGTTTGGCCCCAAATGTCAAAATCATGGATTGGCTTCCACAGACTGACCT 1168 



Qy 1138 CCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGA 1197 

II I II I I I M I I I I I II I I I M I I I I I I I |..| I I I I I I I I I I I I I I I I ^ 
Db 1169 TCTAGCTCACCCTAGCATTCGTCTGTTTGTCACTCATGGAGGGATGAACAGTGTGATGGA 1228 

Qy 1198 GGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAA 1257 

III I I I I I II I I I I I I I I I I II I I II I I I I I I I I II I II I I I I I I I I 
Db 1229 GGCTGTCCATCATGGAGTACCCATGGTGGGGATTCCATTTTTTTTTGACCAACCTGAAAA 1288 

Qy 1258 CATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGC 1317 

I I I I I I I I I I I I I I I I I I II II I I I I I I i I I I I I I I I I I II III I I I I I I I I I 
Db 1289 CATGGTCCGAGTAGAAGCAAAGAACCTTGGTGTTTCTATTCAGCTACAGACGCTCAAGGC 1348 

Qy 1318 AGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGC 1377 

I I I I I I I I II II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II II 
Db 1349 AGAGTCATTTGCGCTCACCATGAAAAAAATCATAGAAGACAAGAGGTACAAGTCTGCAGC 1408 

Qy 1378 AGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGG 1437 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1409 AATGGCCTCCAAGATT ATCAGGCACTCCCACCCACTGACCCCTGCCCAGAGGCTTTTGGG 1468 

Qy 1438 CTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCA 1497 

I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1469 CTGGATAGATCATATCTTGCAGACAGGGGGTGCAGCACATCTCAAGCCATATGCTTTCCA 1528 

Qy 1498 GCAGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCT 1557 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II 
Db 1529 GCAGCCATGGCATGAGCAGTACATGCTTGATGTCTTCCTCTTTCTCCTAGGGCTCATGCT 1588 

Qy 1558 GGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGC 1617 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1589 GGGTACTTTGTGGCTTAGTGTAAAGGTTCTTGTTGCTGTAACCAGGTATCTGAGTATAGC 1648 

Qy 1618 CAGAAAGGTG7\AGGAGACATAAGGC 1642 

I I I I I I I I I I I I I I I I I I I 
Db 1649 AACGAAGGTCAAGGAGGCATAATGC 1673 



RESULT 3 
AY404344 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



AY404344 1180 bp DNA linear GSS 12-DEC-2003 

Pan troglodytes HCM1859 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. - 
AY404344 

AY404344.1 GI: 39760321 
GSS. 

Pan troglodytes (chimpanzee).,. 
Pan troglodytes 
Eukaryota; Metazoa; Chorda ta; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1180) 

Clark,A.G., Glanowski, S . , Nielson,R., Thomas, P., Ke jariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp -mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Pan. 



14671302 

2 (bases 1 to 1180) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello,D.R., Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Direct Submission 

Submitted (16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualifiers 
1. .1180 
/organism="Pan troglodytes" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9598" 
gene <1. .>1180 

/locus_tag="HCM1859" 

ORIGIN 



PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



Query Match 37.0%; 
Best Local Similarity 73.0%; 
Matches 861; Conservative 



Score 857.4; DB 9; Length 1180; 
Pred. No. 8.5e-232; 
0; Mismatches 319; Indels 0; 



Gaps 



0; 



Qy 

Db 



460 GGATTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCC 519 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 
1 GGATTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCC 60 



Qy 

Db 



520 TTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGG 579 
I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
61 TTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGG 120 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



580 



121 



640 



181 



700 



241 



760 



301 



820 



361 



CTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCT 639 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCT 180 

GACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTG 699 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ri I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTG 240 



CAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAGGAACATTTCACAGA 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAGGAGGCAATGGGACATGCAGTCTACATTTGACAACACCATCAAGGAGCATTTCCCAGA 



759 



300 



819 



AGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I \IU I I I I I I I I I I I I I I I 
AGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTNNNTTAACTC 360 

TGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTT 879 

II II I I II II I I II I II 

TGNNNTTGCCTTTGATTTTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 420 



880 GATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGG 939 
421 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 480 



Qy 



940 



GGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGA 999 



Db 



481 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 540 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1000 AATCTTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTG 1059 

541 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 600 

1060 TCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTG 1119 

601 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 660 

1120 GCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGG 1179 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
NNNNNNNNNNNNNNNNNNNNNNNCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGG 



661 



720 



1180 GCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTT 1239 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTT 



721 



780 



124 0 TGGAGACCAGCCTGAAAACATGGTCCGAGTAGi\AGCCAAAAAGTTTGGTGTTTCTATTCA 1299 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
781 TGGAGACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCA 840 

1300 GTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAA 1359 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
841 GTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAA 900 

1360 GAGATACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCC 1419 

I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAGATACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCNNNNAGCCC 



901 



960 



1420 CACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCT 1479 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCT 1020 



961 



1539 



1480 CAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 CAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGCTCGACGTTTTTGTGTT 1080 

1540 TCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGT 1599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
1081 TCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGT 1140 

1600 CTGGTGGCTGCGTGGGGCCAGAAAGGTGAAGGAGACATAA 1639 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1141 CTGGTGGCTGCGTGGGGCCAGAAAGGTGAAGGAGACATAA 1180 



RESULT 4 
BQ216829 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



BQ216829 879 bp mRNA linear EST 02-MAY-2002 

AGENCOURT_7594754 NIH_MGC_92 Homo sapiens cDNA clone IMAGE: 6043558 
5', rtiRNA sequence. 
BQ216829 

BQ216829.1 GI: 20398229 
EST. 

Homo sapiens (hioman) 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 879) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail . nih . gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/ LLNL at: 

http : / / image . llnl . gov 

Plate: LLAM13284 row: 1 column: 23 

High quality sequence start: 17 

High quality sequence stop: 671. 
Location/Qualifiers 
1. .879 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/ cl one= " IMAGE :6043558" 

/tissue_type="erabryonal carcinoma, cell line" 
/iab_host="DH10B (phage-resistant) " 
/ cl one_l ib= " N I H_MGC_9 2 " 

/note="Organ: testis; Vector: pCMV-SP0RT6; Site_l: NotI; 
Site_2: Sail; Cloned unidirectionally; oligo-dT primed. 
Average insert size 2-5 kb. Library enriched for 
full-length clones and constructed by Life Technologies. 
Note: this is a NIH_MGC Library." 



Query Match 36.8%; 
Best Local Similarity 99.0%; 
Matches 869; Conservative 



Score 853.2; DB 5; 
Pred. No. 1..2e-230; 
0; Mismatches 8; 



Length 879; 



Indels 



1; Gaps 



1; 



Qy 

Db 



780 CTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTG 839 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
1 CTCATCTTCTACTGAAAGCAGAGATGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTG . 60 



Qy 

Db 

Qy 

Db 

Qy 

Db 



840 CTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAAC 899 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 CTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAAC 120 

900 CAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTG 959 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 CAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTG 180 

960 TGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACA 1019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 TGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACA 240 



Qy 1020 



ATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



1079 



Db 



241 ATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCA 300 



Qy 1080 AAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCC 1139 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I 
Db 301 AAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTGCTCAGAGTGACCTCC 360 

Qy 1140 TGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGG 1199 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 
Db 361 TGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGG 420 

Qy 1200 CCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACA 1259 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACA 480 

Qy 1260 TGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAG 1319 

I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 11 I I I I I I I I I I I M 
Db 481 TGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAG 540 

Qy 1320 AGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAG 1379 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 541 AGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAG 600 

Qy 1380 TGGCTGCCAGTGTCATCCTGCGCTCCCACCCGGTCAGCCCCACACAGCGGCTGGTGGGCT 1439 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 TGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCT 660 

Qy 1440 GGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGC 1499 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 661 GGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGC 720 

Qy 1500 AGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGG 1559 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 721 AGCCCTGGCATGAGCAGTACCTGCTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGG 780 

Qy 1560 GGACTCTATGGCTTTGT-GGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCC 1618 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 781 GGACTCTATGGCTTTGTGGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCTTGGGGCC 840 

Qy 1619 AGAAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTG 1656 

I I I I I I I I I I I I I I I III II I I II II II I II I I 
Db 841 ACAAAGGGGAAGGAGACCTAAAGCCAGGTGCACCCTTG 878 



RESULT 5 
CN396938 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



759 bp mRNA 
GRN_EB Homo satpiens cDNA 



CN396938 
17000417785787 
CN396938 

CN396938 .1 GI : 47384533 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 759) 
Brandenberger, R. , Wei^H., Zhang, S 
Li,Y., Xu,C., Fang,R., Guegler,K. 



linear... EST 16-MAY-2004 
, mRNA sequence. 



Craniata ; Vertebrata ; Euteleos tomi ; 
Catarrhini; Hominidae; Homo. 



, Lei^S., Murage, J., FisJc^G.J. 
Rao,M. S. , Mandalam, R. , 



TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



Lebkowski,J and Stanton^ L.W. 

Trans crip tome characterization elucidates signaling networks that 

control human ES cell growth and differentiation 

Nat. Biotechnol. 22 (6), 707-716 (2004) 

Contact: Brandenberger R 

Regenerative Medicine 

Geron Corporation 

230 Constitution Drive, Menlo Park, CA 94025, USA 

Tel: 650 473 8658 

Fax: 650 473 7760 

Email : rbrandenberger@geron . com 

Insert Length: 759 Std Error: 0.00. 

Location/Qualifiers 

1. .759 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 

/tissue_type="embryonic stem cells, embryoid bodies 
derived from HI, H7 and H9 cells" 
/clone_lib="GRN_EB" 

/note="oligo dT primed, full-length enriched cDNA library 
from embryoid body outgrowths derived from hES cell lines 
HI (p32), H7 (p29), and H9 (p26) maintained in feeder-free 
conditions . " 



ORIGIN 



Query Match 32.5%; Score 754; DB 7; Length 759; 

Best Local Similarity 100.0%; Pred. No. 1.9e-202; 

Matches 754; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I 
Db 6 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 65 

Qy 61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I 
Db 66 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 125 



Qy 121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 126 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 185 



Qy 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

I I I I I I I I I I I I I M I M I I 11 I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M 

Db 186 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 245 

Qy 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 246 CAAAAGAGGTCCTTTTATGCCAGATTTTA/SiAAAGGAAGAAAAATCATATCAAGTTATCAG 305 

Qy 301 TTGGCTTGCACCTGJ^AGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 306 TTGGCTTGCACCTGAAGATCATCAAAGAG7\ATTTAAAAAGAGTTTTGATTTCTTTCTGGA 365 



Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 
Db 366 AGAAACTTTAGGTGGCAGAGGAAAATTTGATVAACTTATTAAATGTTCTAGAATACTTGGC 425 



Qy 

Db 



421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I 
426 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 485 



Qy 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 486 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 545 

Qy 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 546 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 605 

Qy 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I 

Db 606 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 665 

Qy 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 666 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 725 

Qy 721 GTCTACATTTGACAACACCATCAAGGAACATTTC 754 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 726 GTCTACATTTGACAACACCATCAAGGAACATTTC 759 



RESULT 6 
CD050395 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



CD050395 916 bp rtiRNA linear EST 09-MAY-2003 

AGENCOURT_13977999 NIH_MGC_173 Homo sapiens cDNA 5', mRNA sequence. 

CD050395 

CD050395.1 GI:30487290 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia;. Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 916) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Healthy Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r(?mail . nih . gov 

Tissue Procurement: Dr. Jamie Thompson, University of WI 
cDNA Library Preparation: Gina Zastrow-Hayes 
cDNA Library Arrayed by: The I.M.A.G.E. Consortiiun (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/ LLNL at: 

http: //image. llnl.gov 

Plate: NDKM37 row: m column: 04 

High quality sequence start: 12 

High quality sequence stop: 721. 
Location/Qualifiers 
1. .916 

/organism="Homo sapiens" 
/mol_type="mRNA" 



/db_xref="taxon:9606" 

/tissue_type="entbryonic trophoblasts^ made from WAOl. stem 
cells" 

/lab_host="DH10B TonA" 
/ clone_l ib= "NX H_MGC_1 73" 

/note= "Vector: pDONR201; Site_l: attP2; Site_2: attPl; 
LIBR_PRIMING - oligo dT; METHOD - full-length enriched; 
LIBR PROVIDER - Bradfield" 



ORIGIN 



Query Match 31.8%; Score 738; DB 6; Length 916; 

Best Local Similarity 95.0%; Pred. No. 7.2e-198; 

Matches 795; Conservative 0; Mismatches 36; Indels 6; Gaps 3; 

Qy 1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 

I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I M I I i I I I I I I I I I I I I I I I I I I I I I I I I 
Db 54 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 113 

Qy 61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I .I I I I I I I I I I I I 

Db 114 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 173 

Qy 121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

II I II I I II II II II II I II I II II II II I II I I I II II I II I II II II I II i II I I I II 

Db 174 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 233 

Qy 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

II I I I I I I I I I M II I I I II I I I II I I I I M I I I I I II M II I II II I I I I I I I 11 I I II 

Db 234 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 293 

Qy 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 294 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 353 

Qy 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 354 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 413 

Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I U I I I I I 
Db 414 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 473 

Qy 421 GTTGCAGTGCAGTCATTTTTTA/SATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 474 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 533 

Qy 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 534 CTTCGACAT-GTGATAGTTGTWVCTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 592 

Qy 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I 
Db 593 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 652 

Qy 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 653 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 712 



Qy . . 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 713 GGGCCGAGTGAAGAATTTTCTGATNGTCTTTAGTTTCTGCAGG7\AGCAACAGCACATGCA 772 

Qy 721 GTCTACATTTGACAACACCATCAAGGAACA—TTTCACAGAAGGCTCTAGGCCAGTTTTG 778 

I I I I I I I I I I I I I I I I I I I I I I I I III I II I II I I I II I I I I I I I I III 

Db 773 GTCTACATTTGACAACACCATCCAAGGACATTTTTCACAGAGGGCTCTAGGGCCAGTTTT 832 

Qy 779 TCTC ATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTT 832 

I I I I I I I I M II I II I MM II I I MM M 
Db 833 GTCCTCAATCTTCTACTGAAAAGCAAAAGTTGGTGGGTTCATTTAACCTCTGGACTT 889 



RESULT 7 
AY404345 
. LOCUS 
DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



gene 
ORIGIN 



AY404345 1185 bp DNA linear GSS 12-DEC-2003 

Mus musculus HCM1859 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY404345 

AY404345.1 GI:39760322 
GSS. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

1 (bases 1 to 1185) 

Clark, A. G.;, Glanowski, S . , Nielson,R., Thomas, P., Ke j ariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human-chimp-mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 1185) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
Direct Submission 

Submitted ( 16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualifiers 

1. .1185 

/organism="Mus musculus" 
/mol_type="genomic DNA" 
/db_xref="taxon: 10090" 
<1. .>1185 
/locus_tag="HCM1859" 



Query Match 31.6%; Score 733.8; DB 9; 

Best Local Similarity 76.8%; Pred. No. 1.3e-196; 
Matches 910; Conservative 0; Mismatches 272; 



Length 1185; 
Indels 3; Gaps 



1; 



458 ATGGATTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACTTTTGACTACTGT -517 

I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I III I I II 

1 ATGGACTTCTTA/^AAAATGAGAACTTTGACCTAGTACTTCTTGATTCAATGGATCTCTGT 60 

518 CCTTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTC 577 

II II II II II I II I I I II I I II II I I I I I II I I I II I II II I I'M 

61 TCTTTGCTAATTGTTGAAAAGCTTGGGAAACGATTTGTGTCCTTTCTTCCCTTTCAATTT 120 

578 GGCTCTTTGGAATTTGGGCTACCAA TCCCCTTGTCTTATGTTCCAGTATTCCGTTCC 634 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I, I II I I I I I I I 

121 AGCTATATGGACTTTGGGTTACCAAGCGCCCCCTTGTCCTATGCTCCAGTGTATGGTTCT 180 

635 TTGCTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGT 694 
II I I I II M I I I II I II I I I I I I II I M I I II I I I I II I II I I I I I I M I 

181 GGTCTAACTGACCAAATGGACTTCTGGGGCCGAGTGAAGAACTTTCTGATGTTCCTTGAT 240 

695 TTCTGCAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAGGAACATTTC 754 

I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 

241 TTCTCCATGAAGCAAAGGGAAATCCTTTCTCAGTATGACAGCACTATCCAGGAGCATTTT 300 

755 ACAGAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATT 814 
II I I I I I II I I I I I I I I I I I I I I I I M I II I I I I I I III I I I I I I I I 

301 GTGGAAGGCTCTCAGCCAGTGTTGTCTGACCTTCTACTGAAAGCTGAGCTGTGGTTTGTC 360 

815 AACTCTGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGA 874 

I I I I I I M I I I I I I II I I I I I I I I I I I II III I I I I I I I I I I II I I I I I III 

361 AACTCTGACTTTGCTTTGGATTTTGCTCGTCCCCTGTTTCCCAACACAGTCTATGTGGGA 420 

875 GGCTTGATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAG 934 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II 

421 GGTTTACTGGACAAACCTGTTCAGCCAATACCCCAAGACTTGGAGAATTTTATCTCTCAG 480 

935 TTTGGGGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAAT 994 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I II I I I I 

481 TTTGGAGACTCAGGTTTTGTCCTTGTGGCCCTGGGCTCTATAGTGAGCATGATTCAGTCC 540 

995 CCGGAAATCTTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGG 1054 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I t I I I I I I I I 

541 AAGGAAATTATTAAGGAGATGAACAGTGCCTTTGCTCACCTCCCTCAAGGGGTGCTATGG 600 

1055 AAGTGTCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTG 1114 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

601 ACATGTAAGACTTCTCATTGGCCCAAAGATGTCAGTTTGGCCCCAAATGTCAAAATCATG 660 

1115 GACTGGCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCAC 1174 

II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

661 GATTGGCTTCCACAGACTGACCTTCTAGCTCACCCTAGCATTCGTCTGTTTGTCACTCAT 720 

1175 GGCGGGCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCT 1234 
II I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

721 GGAGGGATGAACAGTGTGATGGAGGCTGTCCATCATGGAGTACCCATGGTGGGGATTCCA 780 

1235 CTCTTTGGAGACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCT 1294 
I III I I I II I I I II I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I 

781 TTTTTTTTTGACCAACCTGAAAACATGGTCCGAGTAGAAGCAAAGAACCTTGGTGTTTCT 840 



Qy 1295 ATTCAGTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAA 1354 

I I I I I I II III I I I I I I I I II II I I I I I II II I I I I I I I II I I I II III 
Db 841 ATTCAGCTACAGACGCTCAAGGCAGAGTCATTTGCGCTCACCATGAAAAAAATCATAGAA 900 

Qy 1355 GACAAGAGATACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTC 1414 

I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I 
Db 901 GACAAGAGGTAC7SAGTCTGCAGCAATGGCCTCCAAGATTATCAGGCACTCCCACCCACTG 960 

Qy 1415 AGCCCCACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACG 1474 

I III I Ml I I I I I I I II I I I I I II II II I I I I I I I I I I I I II I 

Db 961 ACCCCTGCCCAGAGGCTTTTGGGCTGGATAGATCATATCTTGCAGACAGGGGGTGCAGCA 1020 

Qy 1475 CACCTCAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTT 1534 

II I I I I I I I I I I I I II I II I I I .1 I I I I I I I I I I I I I I I I II I II II II 

Db 1021 CATCTCAAGCCATATGCTTTCCAGCAGCCATGGCATGAGCAGTACATGCTTGATGTCTTC 1080 

Qy 1535 GTGTTTCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATG 1594 

I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I M I I I I I I I I 
Db 1081 CTCTTTCTCCTAGGGCTCATGCTGGGTACTTTGTGGCTTAGTGTAAAGGTTCTTGTTGCT 1140 

Qy 1595 GCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGTGAAGGAGACATAA 1639 

I I I I I I I I I I III I II M I I I I II I I I I I 

Db 1141 GTAACCAGGTATCTGAGTATAGCAACGAAGGTC7VAGGAGGCATAA 1185 



RESULT 8 
AK041045 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE . 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
• REFERENCE 
AUTHORS 



Craniata; Vertebrata; Euteleostomi; 
Sciurognathi; Muridae; Murinae; Mus. 



AK041045 1353 bp mRNA linear HTC 03-APR-2004 

Mus musculus adult male aorta and vein cDNA^ RIKEN full-length 
enriched library, clone :A530069C13 product : hypothetical 
UDP-glucoronosyl and UDP-glucosyl transferase containing protein, 
full insert sequence. 
AK041045 

AK041045.1 GI: 26334156 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 

1 

Carninci^P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,?., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K. , Itoh,M. , 
Konno , H • , Akiyama , J . 



Mzawa,K., Nagaoka,S., Sasaki, N., 
, Nishi,K., Kitsunai,T., Tashiro,H, 
Sumi,N., Ishii,Y., Nakamura,S., Ha2ama,M., Nishine,T., 



Carninci, P. 
, Itoh,M., 
Harada,A. , 



Yamamoto^R. , Matsumoto, H. , Sakaguchi, S . , Ikegami^T., Kashiwagi, K. ^ 
Fujiwake,S., Inoue^K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda^Y., Ishikawa,T., Ozawa,K., Tanaka^T., Matsuura,S.^ Kawai,J, 
Okazaki^Y., Muramatsu,M. ^ Inoue^Y., Kira^A. and Hayashizaki, Y. 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 

MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

TITLE Functional annotation of a full-length mouse cDNA collection . 

JOURNAL Nature 409, 685-690 (2001) 

REFERENCE 5 

AUTHORS The FANTOM Consortium and the- RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 1353) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,?. 
Fukuda,S., Furuno,M. , Hanagaki,T., Hara,A., Hashizume, W . , 
Hayashida, K. , Hayatsu,N,, Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K-, Ishii,Y., Itoh,M., Kagawa,!., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C., Matsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C., Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted ( 16- JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC), 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc. riken. jp, 
URL :http: //genome. gsc.riken. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. . 

Please visit our web site for further details. 
URL : http : / /genome . gs c . ri ken . j p/ 
URLrhttp: //f antom. gsc . riken. jp/ . 
FEATURES Location/Qualifiers 
source 1. .1353 

/organism="Mus mus cuius" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref="FANTOM_DB:A530069C13" 

/db_xref="taxon: 10090" 

/clone="A530069C13" 



/ sex="male" 

/tissue_type="aorta and vein" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="adult" 
CDS 83. .991 

/note="unnamed protein product; hypothetical 

UDP-glucoronosyl and UDP-glucosyl transferase containing 

protein (InterPro | IPR002213, evidence: InterPro) 

putative " 

/ codon_s ta r t=l 

/protein__id="BAC30796.1" 

/db_xref="GI: 26334157" 

/translation="MAAHRRWLLMSFLFLEVILLEA7yCILTISTLSASHYIVISRVSQ 
VLHEGGHNVTKLLYESANIPDFRKEKPSYQVINWRPPEDQEKKFADLRHRLTEEITYG 
RSKHHTLLKIHQYFGDLCSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKR 
FVSFLPFQFSYMDFGLPSAPLSYAPVYGSGLTDQMDFWGRVKNFLMFLDFSMKQREIL 
SQYDSTIQEHFN^GSQPVLSDLLLKTVELWFVNSDFALDFARPLFPNTVYVGGYWTNLF 
SQYPKTWRILSLSLETQVLSLWPWAL" 

ORIGIN 

Query Match 31.5%; Score 731.6; DB 3; Length 1353; 

Best Local Similarity 73.1%; Pred. No. 5.6e-196; 

Matches 981; Conservative 0; Mismatches 354; Indels 7; Gaps 3; 
Qy 1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 

I I I I I I I I i I I I I I M I I I I I I I I I I I I 

Db 16 AGGGTCCCTTAGTGGGGCACAGCTCTTTAGGTGTGCAACCTGTGTCAGGGCTCCACATAT 75 

Qy 61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 76 TCTACAGATGGCAGCACATCGGCGTTGGCTTCTCATGAGCTTCCTTTTCCTTGAGGTTAT 135 

Qy 121 GCTCTCAGAGGCTGCCAA7\ATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I II II II II I 
Db 136 TCTCTTGGAGGCTGCAAAAATCCTGACTATATCTACACTGAGTGCAAGCCATTATATAGT 195 

Qy 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 

Db .196 GATAAGCCGTGTGTCGCAAGTTCTTCATGAAGGTGGCCACAATGTGACCAAACTTCTTTA 255 

Qy 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

III II II III I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I 
Db 256 TGAAA GTGCTAATATCCCAGATTTTAGAAAGGAAAAACCATCATATCAAGTTATTAA 312 

Qy 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I I III I I I I I I II II II 

Db 313 TTGGCGTCCACCTGAAGATCAGGAAAAGAAATTTGCTGATCTTAGGCATCGACTTACAGA 372 

Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I II I I III I I I I I I I II II I I I I I I I I 

Db 373 AGAAATAACTTATGGCAGGTCCAAACATCACACCCTTCTAAAGATCCATCAATACTTTGG 432 

Qy 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 433 GGACTTATGCAGTCAATTATTAAGCAGAAAGGACATCATGGACTTCTTAAAAAATGAGAA 492 



Qy 



481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 



I I I I I I I I I I MM I I M M M M M M M M M I M M I 
493 CTTTGACCTAGTACTTCTTGATTCAATGGATCTCTGTTCTTTGCTAATTGTTGAAAAGCT 552 



541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

M M M I M M M I M M M I M I Ml M I I M M M M M M M 
553 TGGGAAACGATTTGTGTCCTTTCTTCCCTTTCAATTTAGCTATATGGACTTTGGGTTACC 612 

601 AA TCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTT 657 

M M M M M I M M M M M I MM M M M I M M M M M 
613 AAGCGCCCCCTTGTCCTATGCTCCAGTGTATGGTTCTGGTCTAACTGACCAAATGGACTT 672 

658 CTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACAT 717 

M M M M M M M M M M M M M M M M M M I M M MM I I M 
673 CTGGGGCCGAGTGAAGAACTTTCTGATGTTCCTTGATTTCTCCATGAAGCAAAGGGAAAT 732 

718 GCAGTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTT 777 

I Ml I M M I M I M I M M M M I M M M M I M M M I I 

733 CCTTTCTCAGTATGACAGCACTATCCAGGAGCATTTTGTGGAAGGCTCTCAGCCAGTGTT 792 
778 GTCTCATCTTCTACTG7VAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTT 837 

MM I M M M M M M M Ml I II I I I I I II II II II M II I I II II I II 
793 GTCTGACCTTCTACTGAAAGCTGAGCTGTGGTTTGTCAACTCTGACTTTGCTTTGGATTT 852 

838 TGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAA 897 

II II II II Ml II M M II II II II II I II II I II I II I II II II II I 
853 TGCTCGTCCCCTGTTTCCCAACACAGTCTATGTGGGAGG-TTACTGGACAAACCTGTTCA 911 

898 ACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCT 957 

III I II I II I II II II II II I II II I I II II II II II I II II II II I II 
912 GCCAATACCCCAAGACTTGGAGAATTTTATCTCTCAGTTTGGAGACTCAGGTTTTGTCCT 971 

958 TGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAA 1017 

II II II I II II II II II II M MM II M I I I M M I I II II I 

972 TGTGGCCCTGGGCTCTATAGTGAGCATGATTCAGTCCAAGGA7\ATTATTAAGGAGATGAA 1031 

1018 CAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCC 1077 

II II I 11 II I II II I II II II I I II II I II II I I III II 11 I I I I I II I M 
1032 CAGTGCCTTTGCTCACCTCCCTCAAGGGGTGCTATGGACATGTAAGACTTCTCATTGGCC 1091 

1078 CAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCT 1137 

II II M II M II II II II II I II II I II II II M II I I MM I i II II 
1092 CAAAGATGTCAGTTTGGCCCCAAATGTCAAAATCATGGATTGGCTTCCACAGACTGACCT 1151 

1138 CCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGA 1197 

M II II M hi Mill M II II I I II II M II II III Ml II I II I II 

1152 TCTAGCTCACCCTAGCATTCGTCTGTTTGTCACTCATGGAGGGATGAACAGTGTGATGGA 1211 

1198 GGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAA 1257 

III II II II II I II II II II M II II II II I II I II II I II II II I I 
1212 GGCTGTCCATCATGGAGTACCCATGGTGGGGATTCCATTTTTTTTTGACCAACCTGAAAA 1271 

1258 CATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGC 1317 

II II II II I II I M II I I II II I II I M II II I II I I II II III II II II II I 
1272 CATGGTCCGAGTAGAAGCAAAGAACCTTGGTGTTTCTATTCAGCTACAGACGCTCAAGGC 1331 

1318 AGAGACATTGGCTCTTAAGATG 1339 
II II II II II II I III 



Db 



1332 AGAGTCATTTGCGCTCACCATG 1353 



RESULT 9 
BI102968 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi; 
Sciurognathi; Muridae; Murinae; Mus. 



BI102968 978 bp mRNA linear EST 26-JUN-2001 

602888578F1 NCI_CGAP_Kidl4 Mus musculus cDNA clone IMAGE: 5043978 
5', mRNA sequence. 
BI102968 

BI102968 .1 GI : 14553861 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
1 (bases 1 to 978) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Healthy Mammalian Gene Collection (MGC) 
Unpublished (1999) 
Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http://image.llnl.gov 
Plate: LLAM11120 row: k column: 19 
High quality sequence stop: 812. 

Location/ Qualifiers 

1. .978 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="FVB/N" 
/db_xref="taxon: 10090" 
/ clone= " IMAGE :5043978" 

/lab_host="DH10B (Tl phage-resistant) " 
/clone_lib="NCI_CGAP_Kidl4 " 

/note="Organ: kidney; Vector: pCMV-SP0RT6; Site_l: NotI; 
Site_2: Sail; Cloned unidirectionally . Primer: Oligo dT 
Average insert size 1.75 kb. Constructed by Life 
Technologies. Note: this is a NCI_CGAP Library. |" 



ORIGIN 



Query Match 22.5%; 

Best Local Similarity 77.7%; 
Matches 682; Conservative 



Score 5 2 2... 4; DB 4; 
Pred. No. 1.6e-136; 
0; Mismatches 191; 



Length 978; 



Indels 



5 ; Gaps 



4; 



Qy 

Db 



644 GATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGG 703 
II II I I I I I I I I II I I I I I I I II II I I I I I I I I I I I I I II I I I MM II I 
1 GACCAAATGGACTTTTGGGGCCGAGTGAAGAACATTCTGATGTTCTTTCATTTCACCAAG 60 



Qy 

Db 



704 AGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAGGAACATTTCACAG7VAGGC 763 
I I I I I I II I III M I I II I I I I I I I I I I I I I I I I I I II I 
61 AAGCGAAGGGACATCTTTTCTCAATATGGCAATACTGTCCAGGAGCATTTTGCGGAAGGC 120 



Qy 764 TCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGAC 823 

III I I I I I I I 11 I I I I I I I I I I II I I I I II III I I I I I I I I I I I I I I I I I . 
Db 121 TCTCAGCCAGTGTTGTCTGACCTTCTACTGAAAGCTGAGCTGTGGTTTGTCAACTCTGAC 180 

Qy 824 TTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATG 883 

I I I II II I II I I I I I I I II II III I I I I I I II I I II I I I I I I I I I I II II 
Db 181 TTTGCCTTGGATTTTGCTCGTCCCCTGTTTCCCAACACAGTCTATGTGGGAGGTTTACTG 240 

Qy 884 GAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGAC 943 

I I I I II II II I I I I II I I I I I II I I I I I I I I I I I I I I I 11 I I I I I I 
Db 241 GACAAACCTGTTCAGCC7\ATACCCCAAGACTTGGAGGATTTTATCTCTCAGTTTGGAGAC 300 

Qy 944 TCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATC 1003 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 301 TCAGGTTTTGTCCTTGTGGCCCTGGACTCTGTAGTGAGCATGATTCAGTCCAAGGAAATT 360 

Qy 1004 TTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAG 1063 

I I I II I I I I II I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I III II 
Db 361 ATTAAGGAGATGAACAGTGCCTTTGCTCACCTCCCTCAAGGGGTGCTATGGACATGTAAG 420 

Qy 1064 TGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTT 1123 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 AGTTCTCATTGGCCCAAAGATGTCAGTTTGGCCCCAAATGTCAAAATCATGGATTGGCTT 480 

Qy 1124 CCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAG 1183 

II I I I I I I I I I I II I I I I I I I I Mill I I I I I I I I I I I I I I I I I II III I 

Db 481 CCACAGATTGACCTTCTAGCTCACCCTAGCATTCGTCTGTTTGTCACCCATGGGGGGATG 540 

Qy 1184 AATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGA 1243 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 AACAGTGTAATGGAGGCTGTCCATCATGGAGTACCAATGGTAGGGATTCCATTTTTTGGA 600 

Qy 1244 GACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTA 1303 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I M I I I I I I I I I I I I II 
Db 601 GACCAACCTGAAAACATGGTCCGAGTAGAAGCAAAGAACCTTGGTGTTTCTATTCAGCTA 660 

Qy 1304 AAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGA 1363 

III I I I I M I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 661 CAGACACTCAAGGCAGAGTCATTTTGGCTCACCATGAAAGAAGTCATAGAAGACCAGAGG 720 

Qy 1364 TACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACA 1423 

I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I III 
Db 721 TACAAGACTGCAGCA-TGGCCTCCAAGGTTATCAGGAACTCCCACC—AATGACCCTGGC 777 

Qy 1424 CAGCGGCTGGTGGGCTGGATTGACCACGTC-CTCCAGACAGGGGGCGCGACGCACCTCAA 1482 

^^.U I I I I I I I I I I I I I I I I I I I Ml II I I I I I I I I I I I I II ^k4l I I I I I 
Db 778 CAGAGGCTTGTGGGCTGGATTGATCACATCTTTGCAGACAGGGGGTGCAGCCCATCTCAA 837 

Qy 1483 GCCCTATGTCTT-TCAGCAGCCCTGGCATGAGCAGTAC 1519 

I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 838 AGCATATGGCTTCCCAGCAGGCTTGGCATAACGAGTAC 875 



RESULT 10 
CN396939 

LOCUS CN396939 510 bp mRNA linear EST 16-MAY-2004 

DEFINITION 17000599940872 GRN_PRENEU Homo sapiens cDNA 5', mRNA sequence. 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



CN396939 

CN396939.1 GI:47384534 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 510) 

Brandenberger, R. , Wei^H., Zhang^S., Lei,S.^ Murage, J. ^ Fisk,G.J., 
Li^Y., Xu,C.^ Fang,R., Guegler^K., Rao, M.S., Mandalam, R., 
Lebkowski,J and Stanton, L.W. 

Transcriptome characterization elucidates signaling networks that 

control hioman ES cell growth and differentiation 

Nat. Biotechnol. 22 (6), 707-716 (2004) 

Contact: Brandenberger R 

Regenerative Medicine 

Geron Corporation 

230 Constitution Drive, Menlo Park, CA 94025, USA 

Tel: 650 473 8658 

Fax: 650 473 7760 

Email: rbrandenberger@geron.com 

Insert Length: 510 Std Error: 0.00. 

Location/ Qualifiers 

1. .510 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon : 9606" 

/tissue_type="embryonic stem cell, retinoic acid and 
mitogen-treated hES cell line H7" 
/ clone_lib=" GRN__PRENEU " 

/note="oligo dT primed, full-length enriched cDNA library 
from hES cell line H7 (p29) maintained in feeder-free 
conditions, Embryoid bodies were generated in the presence 
of all-trans retinoic and mitogens . " 



ORIGIN 



Query Match 21.8%; 
Best Local Similarity 99.8%; 
Matches 507; Conservative 



Score 506.4; DB 7 
Pred. No. 4.5e-132 
0; Mismatches 1 



Length 510; 

Indels 0 ; Gaps 0 ; 



Qy 1109 ATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTC 1168 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 ATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTC 62 



Qy 

Db 

Qy 

Db 

Qy 

Db 



1169 ACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGG 1228 
I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

63 ACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGG 122 

1229 ATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGT 1288 

I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
123 ATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGT 182 

1289 GTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATC 1348 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
183 GTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATC 242 



Qy 



1349 ATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCAC 1408 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 243 ATGGAAGACAAGAGATACAAGTGCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCAC -302 



Qy 1409 CCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGC 1468 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 
Db 303 CCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGC 362 

Qy 1469 GCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGTTCGAC 1528 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 363 GCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGCTCGAC 422 

Qy 1529 GTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTG 1588 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 1. 1 I I I 
Db 423 GTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTG 482 

Qy 1589 GGCATGGCTGTCTGGTGGCTGCGTGGGG 1616 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 483 GGCATGGCTGTCTGGTGGCTGCGTGGGG 510 



RESULT 11 
CR769328 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
COMMENT 



CR769328 717 bp mRNA linear EST 23-SEP-2004 

DKFZp469O0528_rl 469 (synonym: pkidl) Pongo pygmaeus cDNA clone 
DKFZp469O0528 5', mRNA sequence. 
CR769328 

CR769328.1 GI: 52613343 
EST. 

Pongo pygmaeus (orangutan) 
Pongo pygmaeus 

Eukaryota; Metazoa; Chordata; 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi; 



Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pongo. 
1 (bases 1 to 717) 

Ottenwaelder, B. ^ Obermaier, B. , Deutschenbaur, S . ^ Schaipp^-A., 
Mewes>H.W., Weil,B., Amid,C., Osangeri^A. , Fobo,G-, Han,M. and 
Wiemann, S . 

Pongo pygmaeus mRNA (Ottenwaelder, B. , Obermaier^ B. , 
Deutschenbaur^S. , et al.) 

Unpublished (2004) 
Contact: MIPS 
MIPS 

Ingolstaedter Landstr.l, D-85764 Neuherberg, Germany 

This is the 5' sequence of the clone insert. Clone from S. Niemann^ 

Molecular Genome Analysis^ German Cancer Research Center (DKFZ); 

Email s.wiemann@dkfz-heidelberg.de; sequenced by Medigenomix 

(Martinsried/Germany) within the cDNA- sequencing consortium of the 

German Genome Project. This clone (DKFZp4 6900528 ) is available at 

the RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH in 

Berlin, Germany. Please contact RZPD for ordering: 

http: //www. rzpd.de/cgi-bin/products/cl . cgi?CloneID=DKFZp4 6900528 

Further information about the clone and the sequencing project is 

available at http://mips.gsf.de/projects/cdna/. 

Location/Qualifiers 

1. .717 

/organism="Pongo pygmaeus" 

/mol_type= "mRNA" 

/db xref="taxon:9600" 



/clone="DKFZp469O0528" 
/tissue_type="kidney" 
/dev_stage="adult" 
/lab_host="DH10B" 

/clone_lib="469 (synonym: pkidl)" 

/note="Vector: pSportl_Sfi; Site_l: SfilA; Site_2: SfilB" 

ORIGIN 

Query Match 21.1%; Score 490.6; DB 7; Length 717; 

Best Local Similarity 83.1%; Pred. No. 1.6e-127; 

Matches 593; Conservative 0; Mismatches 117; Indels 4; Gaps 3; 



Qy 43 ATCCGCGGCTTCCGTAGAAGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTT 102 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 4 ATCCGCGGCTTTTGTGGAAGTGAGCATGGCTGGGCAGCGGGTGCTGCTTCTAGTGGCCTT 63 

Qy 103 CCTTCTCCCTGGGGTCCTGCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGG 162 

I I I I I I I II I II I I I I I II I I I II II I I I I I I I II I I II I II I I I I II I II I I I II 
Db 64 CCTTCTTTCTGGGGTCCTGCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACACTGGG 123 

Qy 163 TGGAAGCCATTATCTACTGATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAA 222 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I II I I I II I I I 
Db 124 TGGAAGCCATTACCTACTGTTGGACCGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAA 183 

Qy 223 TGTCACCATGCTTAACCACAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAA 282 

I M I I I I I I I I I II I II III II I I I I I I I I I I I I I I I I I I I I 

Db 184 TGTGACTATGCTTCATCAGAGTGGAAAGTTTTTAATCCCAGATATTAAAGAGGAGGAAAA 243 

Qy 283 ATCATATCAAGTTATCAGTTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAG 342 

I I I I I I I I II I I I I II I III II I I I I I I I I I I I I I I I I I I I I I I I III 
Db 244 ATCATACCAAGTTATCAGGTGGTTTTCACCTGAAGATTATCAAAAAAGAATTAAGAAGCA 303 

Qy 343 TTTTGATTTCTTTCTGGAAGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAA 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I 
Db 304 TTTCGATAGTTACATAGAAACAGCATTGGATGGCAGAAAAGAATTTGAAGTCTTTGTAAA 363 

Qy 403 TGTTCTAGAATACTTGGCGTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGA 462 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I 
Db 364 GCTAATGGAAATATTTGGGACTCAATGTAGTTATTTGCTAAGCAGAAAGGATATAATGGA 423 

Qy 463 TTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTT 522 

I I I I I I I I I I I I I I I I I I I I II MM I II II I I I II I II I I I II II I I 
Db 424 TTCCTTAAAGAATGAAAACTTTGATCTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTT 483 

Qy 523 CCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTC 582 

Mill Ml I II I M I 11 II I III II I I I II I I I I II II I I I I II I I I I I II U \ I 
Db 484 CCTGATTGCTGAGAAGCTTGTGAAACCATTTGTGGCCATTCTTCCCACCACATTCGGCTC 543 

Qy 583 TTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGAC 642 

I II I II II I M I I M II I I II I II II M I M I II I I II M II I II II I II I I II I I I 

Db 544 TTTGGATTTTGGGCTACCAAGCCCCTTGTCTTATGTTCCAGTATTCCATTCCTTGCTGAC 603 

Qy 643 TGATCACATGGACTTCT-GGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCA 701 

II II I II II II M II II I II I II I II I II I I II II II M I I II I II It II II I I I II 

Db 604 TGATCACATGGACTTCTGGGGGCCGAGTGAAGAATTTTCTGATGNTCTTTAGTTTCTCCA 663 

Qy 702 GGAGGC—AACAGCACATGCAGTCTACATTT-GACAACACCATCAAGGAACATT 752 



Db 



I I I I I I I I I I I I M I I I M I I I I I I I I M i II I I I I I I I I I I I 
664 GGAGCCCNATGGGGACATGCAGTCTACATTTGGACAACNCCNTCAAGGAGCATT 717 



RESULT 12 
BI559553 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI559553 761 bp mRNA linear EST 05-SEP-2001 

603252894F1 NIH_MGC_97 Homo sapiens cDNA clone IMAGE: 5295502 5*, 
mRNA sequence. 
BI559553 

BI559553.1 GI:15446867 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 761) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Miklos Palkovits, M.D., Ph.D. 

cDNA Library Preparation: Michael J. Brownstein (NHGRI), Shiraki 
Toshiyuki and Piero Carninci (RIKEN) 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)- 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http : / /image . llnl . gov 
Plate: LLAM11747 row: k column: 23 
High quality sequence stop: 726. 

Location/Qualifiers 

1. .761 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/clone="IMAGE: 5295502" 

/lab_host="DH10B" 

/clone_lib="NIH_MGC_97" 

/note="Organ: testis; Vector: pBluescriptR (modified 
pBluescript KS+) ; Site_l: BamHI; Site_2: Sall-Xhol 
(gtcgag) ; Oligo-dT primed using primer 
5 I _TTTTTTTTTTTTTTTTVN-3 • , size-selected for average 
insert size 2.2 kb and normalized to ROT 5. This is a 
primary library enriched for full-length clones and 
constructed using the Cap-trapper method (Carninci, in 
preparation) . Library constructed by M. Brownstein 
(NIMH/NHGRI, National Institutes of Health). Note: this is 
a NIH_MGC Library." 



ORIGIN 



Query Match 21.1%; Score 488.6; DB 4; Length 761; 

Best Local Similarity 80.7%; Pred. No. 6.1e-127; 

Matches 608; Conservative 0; Mismatches 139; Indels 6; Gaps 3; 
Qy 8 CTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAGC 67 



Db 



I I I M I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I M I I I M II I I II II II I 
6 CTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAGC 65 



Qy 68 ATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTCA 127 

I I I I I I I I I I I II I I I I I I I I I I II M I I II I I 1 I I I I I I I II I I I I I I I I M I 
Db 66 ATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTCA 125 

Qy 128 GAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGAC 187 

I I I I II I I I I I I I I I II I I I I I I I I I I I II I I I I I M I I I I I I I I II I I I I I I I I I 
Db 126 GAGGCTGCCAAAATCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGAC 185 

Qy 188 CGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAGA 247 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I II I II 
Db 186 CGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGGA 245 

Qy 248 GGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCTT 307 

II II I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I III II 
Db 246 AAGtTTTGGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTTT 305 

Qy 308 GCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAACT 367 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II i I I I II 
Db 306 TCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGCA 365 

Qy 368 TTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCAG 427 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 

Db 366 TTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCAA 425 

Qy 428 TGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGAC 487 

II I I I J I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 

Db 426 TGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTATGAT 485 

Qy 488 ATGGTGATAGTTGAA-ACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAA 546 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 486 CTGGTATTTGTTGAACGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAA 545 

Qy 547 GCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCC 606 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I III 
Db 546 ACCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTATGGATTTGGGGCTACCAAGCCC 605 

Qy 607 CTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACT-GATCACATGGACTTCTGGGGCC 665 

I I I I I I I I I I I I I I I I I I I I I I I ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db- 606 CTTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGGATCACATGGACTTCTGGGGCC 665 

Qy 666 GAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGG CAACAGCACATGCAG 721 

I I I I I I I I I I I I I I I I I I I I II Ml III I M I I I I I I 

Db ..w.. 666 GAGTGAAGAAATTTCTTGATGTCTTTTAGTTTCCTCCAAGGAGCCCAATGGGACATGCAG 725 

Qy 722 TCTACATTTGACAACACCATCAAGGAACATTTC 754 

I I I I I I II I I I I I I I I I I I I I I I I 

Db 726 TCTACAATTTGACAACACATCAAGGAGCATTTC 758 

RESULT 13 
AW173071/C 

LOCUS AW173071 694 bp mRNA linear EST 16-NOV-1999 



DEFINITION xj82gll.xl Soares_NFL_T_GBC_Sl Homo sapiens cDNA clone 
IMAGE:2663780 3' similar to SW:CGT RAT Q09426 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 



2-HYDROXYACYLSPHINGOSINE 1-BETA-GALACTOSYLTRANSFERASE PRECURSOR 

itiRNA sequence. 

AW173071 

AW173071.1 GI:6439019 
EST. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 694) 

NCI-CGAP http : //www. ncbi . nlm. nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Seq primer: -40UP from Gibco 
High quality sequence stop: 412. 

Location/Qualifiers 

1. .694 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon: 9606" 
/ cl one= " IMAGE :2663780" 
/lab_host="DH10B" 

/clone_lib="Soares_NFL_T_GBC_Sl" 

/note="Organ: pooled; Vector: pT7T3D-Pac (Pharmacia) with 
a modified polylinker; Site_l: Not I; Site_2: Eco RI; ' 
Equal amounts of plasmid DNA from three normalized 
libraries (fetal lung NbHL19W, testis NHT, and B-cell 
NCI_CGAP_GCB1) were mixed, and ss circles were made in 
vitro. Following HAP purification, this DNA was used as 
tracer in a subtractive hybridization reaction. The driver 
was PCR-amplified cDNAs from pools of 5,000 clones made 
from the same 3 libraries. The pools consisted of 
I.M.A.G.E. clones 297480-302087, 682632-687239, 
726408-728711, and 729096-731399. Subtraction by Bento 
Scares and M, Fatima Bonaldo. " 



ORIGIN 



Query Match 20.0%; 
Best Local Similarity 83.5%; 
Matches 537; Conservative 



Score 463.6; DB 2; 
Pred. No- .7.7e-120; 
0; Mismatches 105; 



Length 694; 
Indels 1; 



Gaps 



1; 



Qy 



Db 



268 TAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCTTGCACCTGAAGATCATCAAAG 327 

I I I I I I I III II I I I II I I III I I I I I I I I I II I I 

643 TATAAAGAGAAGCAAAATTCATACAGGTATCCGGGTGGTTTTCACCTAAGATCATCAAAA 584 



Qy 

Db 



328 AGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAACTTTAGGTGGCAGAGGAAAATT 387 

I I I I I I I I I II II I I M I M I I I I I I I I I II I I II I I I I I 
583 AGAA-TTAAGAAGCATTTTGATAGCTACATAGAAACAGCATTGGATGGCAGAAAAGAATC 525 



Qy 

Db 



388 
524 



TGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCAGTGCAGTCATTTTTTAAATAG 447 

11 I I II III I I I I I I I I I I I I I I I I I I I I III II 
TGAAGCCCTTGTANAGCTAATGGAAATATTTGGGACTCAATGTAGTTATTTGCTAAGCAG 465 



Qy 

Db 



448 
464 



AAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACTTT 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II 
AAAGGATATAATGGATTCCTTAAAGAATGAGAACTATGATCTGGTATTTGTTGAAGCATT 



507 
405 



Qy 508 TGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTC 567 

III I I I I I II I I I II I I I I II I II I I II I I I I III I I I I I I I I I I I M I I I I I I 

Db 404 TGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAAACCATTTGTGGCCATTCTTCC 345 

Qy 568 CACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTATT 627 

III I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
Db 344 CACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCCCTTGTCTTATGTTCCAGTATT 285 

Qy 628 CCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTT 687 

II II II I I I I I I I I I 11 I I I II I I I I I I I I I I I I I I M I I I II I II I I I I II II I I I II 

Db 284 CCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTT 225 

Qy 688 CTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAGGA 747 

I I I I I I I I I I I I M I I I III I I I I i I I I I I I I I I I I M I I I I I II I I I I I I I I I I 
Db 224 CTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTACATTTGACAACACCATCAAGGA 165 

Qy -748 ACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTG 807 

MINI I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 164 GCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTG 105 

Qy 808 GTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTA 867 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I 

Db 104 GTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCCCCTGCTTCCCAACACTGTTTA 45 

Qy 868 TGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCACAA 910 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I II I 

Db 44 TATTGGAGGCTTGATGG7UUWVCCTATTAAACCAGTACCACAA 2 



RESULT 14 

CB435346/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



CB435346 659 bp itiRNA linear 

615435 M7VRC 6B0V Bos taurus cDNA 3*, mRNA sequence 
CB435346 

CB435346.1 GI: 29216283 
EST. 

Bos taurus (cow) 
Bos taurus 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 
Bovinae; Bos - 
1 (bases 1 to 659) 
Smith^T. P. L. , Roberts^ A. J. , 
Wray^J.E. and Keele^J.W. 
A second set of bovine ESTs 
Unpublished (2003) 
Contact: Smith TPL 

USDA, ARS, US Meat Animal Research Center 
PO Box 166, Clay Center, NE 68933-0166, USA 
Tel: 402 762 4366 
Fax: 402 762 4390 
Email : smithfiemail . marc . usda . gov 



EST 25-MAR-2003 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Cetartiodactyla; Ruminantia; Pecora; Bovidae; 



Echternkamp, S . E. , Chitko-McKown, C. G. , 
from pooled-tissue normalized libraries 



Single pass sequencing. Bases called with phred vO. 020425. c and 
triiraned. with the aid of - the trim_alt option. Vector identified with 
* cross_match vO. 990329. 
Plate: FQY8035 row: J column: 24 
Seq primer: TAGAAGGCACAGTCGAGG . 
FEATURES Location/Qualifiers 
source 1. ,659 

/organism="Bos taurus" 
/mol_type="mRNA" 
/db_xref="taxon: 9913" 
/tissue_type= "pooled" 
/lab_host="DH10B" 
/clone_lib="MARC 6B0V" 

/note="Vector: pcDNA3.1; Site_l: EcoRI; Site_2: NotI; 
Library made with RNA pooled from multiple tissues 
including liver, lung, hypothalamus, pituitary, and 
placenta/endometrium. " 

ORIGIN 

Query Match 19.7%; Score 457.6; DB 6; Length 659; 

Best Local Similarity 81.1%; Pred. No. 3.8e-118; 

Matches 532; Conservative 0; Mismatches 124; Indels 0; Gaps 0; 

Qy 446 AGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACT 505 

I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 658 AGAAAGGATGTCATGAAGTCTTTAAAGAATGAGAACTTCGACCTGGTGATAGTGGAAATG 599 

Qy 506 TTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTT 565 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 n I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 598 TTTGACTACTGTCCTTTCTTGGTTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTC 539 

Qy 566 TCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTA 625 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I U I I I I I I I I I I I I 

Db 538 CCTTCCGCACTGGGCACCGTGGACTTTGGGCTACCAAGCCCTCTGTCTTATGTGCCAGTG 479 

Qy 626 TTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATG 685 

III I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 478 TTCTACTCCTTGTTAACGGACCAGATGGACTTCTGGGGCCGAGTAAAGAACTTCCTGATG 419 

Qy 686 TTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAG 745 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 418 TTCTTTGAGTTCTTCAAGAAGCAGTGGAAAATCCAGTCTGCATATGATGACACCATCAAG 359 

Qy 746 GAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTG 805 

II I I I I I I I II I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db ^.358 GAGCATTTCCCGGACGACTCTAGGCCAGTTTTGTCTCATCTTCTAACAAAAGCAGAGCTG 299 

Qy 806 TGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTT 865 

I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 298 TGGTTTGTTAACACCGACTTTGCCTTTGATTTTGCTCAGCCCCTGCTTCCCAACACTGTG 239 

Qy 866 TATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAACTTC 925 

I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I III 

Db 238 TGTATTGGAGGTTTAATGTCCAAACCTGTTAAACCAGTACCACAAGAATTTGAGAATTTC 179 

Qy 926 ATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACC 985 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



178 ATCACCAAATTTGGAGACTCTGGTTTTGTCCTTGTAAGCCTGGGGTCCATGGTGAGTTTC 119 



Qy 

Db 

Qy 

Db 



986 TGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGG 1045 

III I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I 
118 ATTCGGTCCCAGGAGGTTCTCAAAGAGATGAATGCTGCCTTTGCTCATCTCCCTCAAGGG 59 

1046 GTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAA 1101 
I I I I I I I I I I I I I I I M I I I I I I II I I I M I I I II I I I I I Mil 
58 GTGATATGGAAGTATAATCCTTCTCATTGGCCCAAAGACATCAAATTGGCCCCAAA 3 



RESULT 15 

AK052644 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

J0URN7VL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 



AK052644 1723 bp mRNA linear HTC 03-APR-2004 

Mus mus cuius 0 day neonate kidney cDNA, RIKEN full-length enriched 
library, clone : D630011D02 product : hypothetical UDP-glucoronosyl and 
UDP-glucosyl transferase containing protein, full insert sequence. 
AK052644 

AK052644.1 GI:26095303 
HTC; CAP trapper. 
Mus mus cuius (house mouse) 
Mus mus cuius 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci^P., Shibata^Y.^ Hayatsu^N., Sugahara,Y., Shibata,K.^ 

Itoh,M., Konno,H., Okazaki^Y. , Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10)^ 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata, K 



Itoh,M., Aizawa,K. 



r Sasaki^N., 
. , Tashiro^H, 
Nishine^T. , 



Carninci, P . , 
, Itoh^M., 
Harada, A. , 



Nagaoka^ S . 

Konno^H., Akiyama,J., Nishi^K., Kitsunai,T 
Sumi,N.^ Ishii,Y., Nakamura^S., Hazama,M., 
Yamamoto^R., Matsumoto, H. , Sakaguchi, S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake^S., Inoue^K., Togawa,Y., Izawa^M., Ohara,E., Watahiki^M., 
Yoneda^Y., Ishikawa,T., Ozawa,K., Tanaka^T., Matsuura^S., Kawai,J. 
Okazaki^Y., Muramatsu^M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 



AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



misc feature 



The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 1723) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P. 
Fukuda,S., Furuno,M., Hanagaki^T., Hara,A., Hashizume, W . , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani^K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa^T., . 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda^M., 
Koya,S., Kurihara^C, Matsuyama, T . , Miyazaki,A., Murata^M., 
Nakamura,M., Nishi^K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C., Sakai,K., Saka2ume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc. riken. jp^ 
URL: http : //genome . gsc. riken. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL: http: / /genome. gsc. riken. jp/ 
URL: http: // fantom. gsc. riken. jp/ . 

Location/Qualifiers 

1. .1723 

/organism="Mus musculus" 

/mol_type= "mRNA" 

/strain="C57BL/6J" 

/db_xref="FANTOM_DB:D630011D02" 

/db_xref="taxon: 10090" 

/clone="D630011D02" 

/tissue_type="kidney" 

/clone_lib="RIKEN full-length enriched mouse cDNA library 
/dev_stage="0 day neonate" 
1. .1723 

/note="hypothetical UDP-glucoronosyl and UDP-glucosyl 
transferase containing protein (InterPro | IPR002213, 
evidence: InterPro) " 



ORIGIN 



Query Match 19.5%; 
Best Local Similarity 69.1%; 
Matches 649; Conservative 



Score 452.6; DB 3; 
Pred. No. 1.5e-116; 
0; Mismatches 284; 



Length 1723; 



Indels 



6; Gaps 



Qy 



1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 
I I I I I I I I I I I I M I I I I I I I I I I I I I I 



41 AGGGTCCCTTAGTGGGGCACAGCTCTTTAGGTGTGCAACCTGTGTCAGGGCTCCACATAT 100 



61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

101 TCTACAGATGGCAGCACATCGGCGTTGGCTTCTCATGAGCTTCCTTTTCCTTGAGGTTAT 160 

121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I II I I I I II I 

161 TCTCTTGGAGGCTGCAAAAATCCTGACTATATCTACACTGAGTGCAAGCCATTATATAGT 220 

181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

III I I I I I I I I I I I I I I I I I I I I I I M M I I I II III I 

221 GATAAGCCGTGTGTCGCAAGTTCTTCATGAAGGTGGCCACAATGTGACCAAACTTCTTTA 280 

241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

III II II III I I I I I I I I I I I I I I I I I II 11 I I I I I I I II I I II I 

281 TGAAA GTGCTAATATCCCAGATTTTAGAAAGGAAAAACCATCATATCAAGTTATTAA 337 

301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

II I I I I I I I I I I I I II I I I III II I I I I I II II II 

338 TTGGCGTCCACCTGAAGATCAGGAAAAGAAATTTGCTGATCTTAGGCATCGACTTACAGA 397 

361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I . I I I I I I Mi I I I I I I II I II I I I I I I I I 

398 AGAAATAACTTATGGCAGGTCCAAACATCACACCCTTCTAAAGATCCATCAATACTTTGG 457 

421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

458 GGACTTATGCAGTCAATTATTAAGCAGAAAGGACATCATGGACTTCTTAAAAAATGAGAA 517 

481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I I I I I I I I I I I I III I I II I I I I I I I I I I I I I I I I I I 

518 CTTTGACCTAGTACTTCTTGATTCAATGGATCTCTGTTCTTTGCTAATTGTTGAAAAGCT 577 

541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I II II I I I I I I I I III I I I I I I I I I I I I I I I I I I 

578 TGGGAAACGATTTGTGTCCTTTCTTCCCTTTCAATTTAGCTATATGGACTTTGGGTTACC 637 

601 AA TCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTT 657 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

638 AAGCGCCCCCTTGTCCTATGCTCCAGTGTATGGTTCTGGTCTAACTGACCAAATGGACTT 697 

658 CTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACAT 717 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I II 

698 CTGGGGCCGAGTGAAGAACTTTCTGATGTTCCTTGATTTCTCCATGAAGCAAAGGGAAAT 757 

718 GCAGTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTT 777 

I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

758 CCTTTCTCAGTATGACAGCACTATCCAGGAGCATTTTGTGGAAGGCTCTCAGCCAGTGTT 817 

778 GTCTCATCTTCTACTGAAAGCAGAGTTGTGfeTTCATTAACTCTGACTTTGCCTTTGATTT 837 

I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

818 GTCTGACCTTCTACTGAAAGCTGAGCTGTGGTTTGTCAACTCTGACTTTGCTTTGGATTT 877 

838 TGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAA 897 

I I I I I I II III I I i I I I I I I I II I I I I I I I I I I II I I I I I I I I I I II I 

878 TGCTCGTCCCCTGTTTCCCAACACAGTCTATGTGGGAGGTTTACTGGACJ\AACCTGTTCA 937 



Qy 898 ACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTT 936 

I I I I I 11 I I I I I I I I I I I II II 

Db 938 GCCAATACCCCAAGTAAGTGACAAATTAGCTCTCACTTT 976 



Search completed: February 15, 2005, 09:44:23 
Job time : 6858 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 



February 15, 2005, 06:26:17 ; Search time 9715 Seconds 

(without alignments) 
11571.395 Million cell updates/sec 

US-10-017-867A-281 
2320 

1 agggtcccttagccgggcgc tctctccccaacctcactaa 2320 



Scoring table: IDENTITY_NUC 

Gapop 10.0 / Gapext 1.0 

Searched: 4708233 seqs, 24227607955 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



9416466 



Post-processing : 



Database : 



Minimum Match 0% 

Maximum Match 100% 

Listing first 1500 summaries 



GenEmbl : 



1 




gb_ba : * 


2 




gb_htg : * 


3 




gb in:* 


4 




gb_om: * 


5 




gb_ov: * 


6 




gb pat:* 


7 




gb_ph : * 


8 




gb_pl : * 


9: 




gb_pr: * 


10: 


gb_ro : * 


11: 


gb_sts : * 


12: 


gb_sy : * 


13: 


gb_un : * 


14: 


gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 2320 100.0 

2 2320 100.0 

3 2310.2 99.6 



2320 6 AX697213 
2320 9 AY358416 
2341 6 AX136141 



AX697213 Sequence 
AY358416 Homo sapi 
AX136141 Sequence 





4 


2310.2 


99. 


6 


2341 


6 


BD123523 


BD123523 Secretory 




5 


2310.2 


99. 


6 


2341 


9 


AK075383 


AK075383 Homo sapi 




6 


2102.8 


90. 


6 


2944 


6 


AX548037 


AX548037 Sequence 




7 


1578.8 


68. 


1 


2263 


6 


AX958403 


AX958403 Sequence 




8 


1281.6 


55. 


2 


2082 


6 


AX359921 


AX359921 Sequence 




9 


1281.6 


55. 


2 


2797 


6 


AX155211 


AX155211 Sequence 




10 


1281.6 


55; 


2 


2823 


6 


AX714604 


AX714604 Sequence 




11 


1281.6 


55. 


2 


2823 


9 


AK057066 


AK057066 Homo sapi 




12 


1280.6 


55. 


2 


2759 


9 


BC068446 


BC068446 Homo sapi 




13 


1280 


55. 


2 


2086 


6 


AX327327 


AX327327 Sequence 




14 


1212 


52. 


2 


1572 


6 


AX359923 


AX359923 Sequence 




15 


1147.8 


49. 


5 


2426 


6 


AR541778 


AR541778 Sequence 


c 


16 


957.6 


41. 


3 


88948 


2 


AC008947 


AC008947 Homo sapi 




17 


957.6 


41. 


3 


179513 


9 


AC016612 


AC016612 Homo sapi 




18 


944.8 


40. 


7 


1569 


6 


AX155213 


AX155213 Sequence 




19 


912.4 


39. 


3 


2210 


10 


AK128903 


AK128903 Mus muscu 




20 


912.2 


39. 


3 


2192 


10 


BC034837 


BC034837 Mus muscu 




21 


912.2 


39. 


3 


2196 


10 


BC022134 


BC022134 Mus muscu 




22 


910.8 


39. 


3 


2212 


6 


AX155214 


AX155214 Sequence 




23 


909.2 


39. 


2 


2228 


10 


BC025940 


BC025940 Mus muscu 




24 


905.8 


39. 


0 


1836 


6 


CQ726650 


CQ726650 Sequence 




25 


789 


34. 


0 


1569 


6 


AX155216 


AX155216 Sequence 




26 


595.4 


25. 


7 


148069 


2 


AC025476 


AC025476 Homo sapi 




27 


595.2 


25. 


7 


770 


6 


AX136483 


7^136483 Sequence 




28 


595.2 


25. 


7 


770 


6 


BD123723 


BD123723 Secretory 




29 


593.4 


25. 


6 


1842 


6 


AX747286 


AX747286 Sequence 




30 


593.4 


25. 


6 


1842 


9 


AK091977 


AK091977 Homo sapi 




31 


592.2 


25. 


5 


102255 


2 


AC026735 


AC026735 Homo sapi 




32 


575.4 


24. 


8 


5002 


6 


CQ842882 


CQ842882 Sequence 




33 


575.4 


24. 


8 


5002 


9 


AK125803 


AK125803 Homo sapi 




34 


573.6 


24. 


7 


125145 


2 


AC022136 


AC022136 Homo sapi 




35 


573.4 


24. 


7 


3108 


9 


BC035012 


BC035012 Homo sapi 


c 


36 


538.8 


23. 


2 


148069 


2 


AC025476 


AC025476 Homo sapi 


c 


37 


538.4 


23. 


2 


594 


6 


AX136632 


AX136632 Sequence 


c 


38 


538.4 


23. 


2 


594 


6 


BD123872 


BD123872 Secretory 




39 


471.2 


20. 


3 


1885 


5 


BC076709 


BC076709 Xenopus t 




40 


468.6 


20. 


2 


1835 


5 


CR760227 


CR7 60227 Xenopus t 




41 


444.8 


19. 


2 


88948 


2 


AC008947 


AC008947 Homo sapi 




42 


434.2 


18. 


7 


1839 


5 


BC082844 


BC082844 Xenopus 1 




43 


-433.6 


18. 


7 


105885 


9 


AC008860 


AC008860 Homo sapi 


c 


44 


320.6 


13. 


8 


156499 


2 


AC132893 


AC132893 Mus muscu 




45 


308.2 


13. 


3 


251769 


2 


AC117913 


AC117913 Rattus no 




46 


308 


13. 


3 


156499 


2 


AC132893 


AC132893 Mus muscu 


c 


47 


245.2 


10. 


6 


211178 


10 


AC139209 


AC139209 Mus muscu 


c 


48 


239.2 


10. 


3 


125145 


2 


AC022136 


AC022136 Homo sapi 


c 


49 


239.2 


lA. 


.3 


165304 


9 


AC112204 


AC112204 Homo sapi 


c 


50'^ 


238.8 


10. 


3 


507 


6 


CQ337751 


" CQ337751 Sequence 


c 


51 


220 


9. 


5 


277 


6 


CQ350335 


CQ350335 Sequence 




52 


174 


7. 


5 


1879 


5 


AF129809 


AF129809 Gallus ga 




53 


167.2 


7. 


2 


3598 


10 


AK128994 


AK128994 Mus muscu 




54 


167.2 


7. 


2 


4185 


10 


RNU07683 


U07683 Rattus norv 




55 


165.6 


7. 


1 


2633 


10 


RATCERUDPG 


L21698 Rat ceramid 




56 


164 


7. 


1 


2308 


6 


AX305987 


AX305987 Sequence 




57 


164 


7. 


1 


2308 


10 


MMGCGTEX 


X92122 M.musculus 




58 


164 


7. 


1 


3439 


10 


BC016885 


BC016885 Mus muscu 




59 


161.2 


6. 


9 


1593 


4 


AY135176 


AY135176 Canis fam 




60 


159.8 


6. 


9 


1609 


4 


AF129810 


AF129810 Bos tauru 





61 


158 


6. 


8 


291 


6 


CQ740738 


CQ740738 Sequence 




62 


156. 8 


6. 


8 


464 


11 


G60984 


G60984 SHGC-84035 


c 


63 


156 . 8 


6. 


8 


69315 


5 


AL935277 


AL935277 Zebrafish 




64 


156. 4 


6. 


7 


1671 


4 


RABUGT2BB 


L01082 Oryctolagus 




65 


156. 2 


6. 


7 


239559 


2 


AC095531 


AC095531 Rattus no 




66 


152 . 6 


6. 


6 


1662 


9 


AF294902 


AF294902 Macaca mu 




67 


ICO ^ 

152 . 5 


6. 


6 


1753 


9 


AF016310 


AF016310 Macaca fa 




68 


152 


6. 


6 


1768 


9 


AF112113 


AF112113 Macaca fa 




69 


151 . 8 


6. 


5 


1832 


4 


RABUGT2BA 


L01081 Oryctolagus 




70 


151 


6. 


5 


2560 


10 


BC027200 


BC027200 Mus muscu 




71 


148. 4 


6. 


4 


2897 


4 


RABUGT2BC 


L01083 Oryctolagus 




72 


148.2 


6. 


4 


2108 


9 


AF112112 


AF112112 Macaca fa 




73 


148 


6. 


4 


1716 


10 


RNUDPGT 


Y00156 Rat mRNA fo 




74 


148 


6. 


4 


1819 


10 


RATGRT 


M31109 Rat UDP-glu 




75 


148 


6. 


4 


2547 


10 


AF175221 


AF175221 Cavia por 


c 


76 


147.2 


6. 


3 


169025 


2 


CR545478 


CR545478 Danio rer 


c 


11 


147.2 


6. 


3 


179798 


5 


BX284110 


BX284110 Zebrafish 




78 


146. 6 


6. 


3 


1825 


10 


AB034987 


AB034987 Cavia por 




79 


145.2 


6. 


3 


1961 


6 


AX401727 


AX401727 Sequence 




80 


145.2 


6. 


3 


1961 


10 


RATUDPGTP 


M13506 Rat liver U 




81 


144 . 8 


6. 


2 


2520 


10 


AB034988 


AB034988 Cavia por 




82 


144 . 6 


6. 


2 


1662 


9 


AF294901 


AF294901 Macaca mu 




83 


144.2 


6. 


2 


1584 


6 


AX497165 


AX497165 Sequence 




84 


144.2 


6. 


2 


2786 


9 


AK025587 


AK025587 Homo sapi 




85 


144 .2 


6. 


2 


3006 


6 


AX497163 


AX497163 Sequence 




86 


143.2 


6. 


2 


1636 


6 


AX327326 


AX327326 Sequence 




87 


143.2 


6. 


2 


1647 


9 


AY542891 


AY542891 Homo sapi 




88 


143.2 


6. 


2 


2759 


6 


AX395164 


AX395164 Sequence 




89 


143.2 


6. 


2 


2966 


6 


AR447660 


AR447660 Sequence 




90 


143.2 


6. 


2 


2974 


6 


AX376454 


AX376454 Sequence 




91 


143.2 


6. 


2 


2974 


9 


AY358727 


AY358727 Homo sapi. 




92 


143 


6. 


2 


2648 


9 


MFU91582 


U91582 Macaca fasc 




93 


142 


6. 


1 


1858 


10 


RATUDPA 


M3374 6 Rat UDP glu 




94 


141. 8 


6. 


1 


336 


6 


AX307956 


AX307956 Sequence 




95 


141. 8 


6. 


1 


1915 


9 


AF401657 


AF401657 Macaca fa 




96 


141.8 


6. 


1 


1947 


10 


RNU27518 


U27518 Rattus norv 




97 


141 


6. 


1 


1648 


10 


BC028262 


BC028262 Mus muscu 




98 


140.2 


6. 


0 


2595 


10 


BC058786 


BC058786 Mus muscu 




99 


140 


6. 


0 


2634 


6 


AX827435 


AX827435 Sequence 




100 


140 


6. 


0 


2634 


10 


RATUDPGTR 


J02589 Rattus norv 




101 


139. 8 


6. 


0 


1584 


10 


AF184901 


AF184901 Mus muscu 




102 


139. 8 


6. 


0 


1639 


6 


AX548042 


AX548042 Sequence 




103 


139. 8 


6. 


0 


2585 


10 


BC048926 


BC048926 Mus muscu 




104 


139 . 8 


6. 


0 


125523 


5 


BX248385 


BX248385 Zebrafish 




105 


139 . 6 


6. 


0 


1896 


10 


BC078782 


BC078782 Rattus no 




106 


139. 4 


6. 


0 


1858 


10 


MMUDP.GT 


X06358 Mouse mRNA 




107 


139. 4 


6. 


0 


1895 


10 


BC019487 


BC019487 Mus muscu 




108 


139.2 


6. 


0 


1855 


6 


CQ812293 


CQ812293 Sequence 




109 


139.2 


6. 


0 


1855 


6 


CQ870648 


CQ870648 Sequence 




110 


139.2 


6. 


0 


1855 


6 


AX336329 


AX336329 Sequence 




111 


139.2 


6. 


0 


1855 


6 


AX336696 


AX336696 Sequence 




112 


139.2 


6. 


0 


1855 


6 


AX409473 


AX409473 Sequence 




113 


139.2 


6. 


0 


1855 


9 


HUMUDPGTA 


J05428 Human 3,4-c 




114 


137.8 


5. 


9 


1831. 


10 


RRUGTGENE 


X57565 Rattus norv 




115 


137.8 


5. 


9 


1891 


10 


BC057169 • 


BC057169 Mus muscu 




116 


137.8 


5. 


9 


1895 


10 


BC069923 


BC069923 Mus muscu 




117 


137.8 


5. 


9 


1903 


10 


BC015272 


BC015272 Mus muscu 



118 


137 


5. 


9 


2125 


5 


BC078212 


BC078212 Danio rer 


119 


136. 8 


5. 


9 


1846 


10 


RNUDPGTR 


X03478 Rat liver m 


120 


136. 4 


5. 


9 


1692 


9 


BC075069 


BC075069 Homo sapi 


121 


136. 4 


5. 


9 


1844 


10 


RNU06273 


U06273 Rattus norv 


122 


136. 4 


5. 


9 


2107 


6 


AR168316 


AR168316 Sequence 


123 


136.4 


5. 


9 


2107 


9 


HSU59209 


U59209 Homo sapien 


124 


136.4 


5. 


9 


2172 


6 


CQ850790 


CQ850790 Sequence 


125 


136.4 


5. 


9 


2172 


9 


AK127970 


AK127970 Homo sapi 


126 


136.4 


5. 


9 


2448 


6 


AX335823 


AX335823 Sequence 


127 


136.4 


5. 


9 


2448 


6 


AX770547 


AX770547 Sequence 


128 


136.4 


5. 


9 


2448 


9 


HSU30930 


U30930 Human UDP-G 


129 


136 


5. 


9 


1766 


9 


BC030974 


BC030974 Homo sapi 


130 


136 


5. 


9 


1854 


6 


BD229166 


BD229166 Genotype 


131 


136 


5. 


9 


1854 


6 


AR349418 


AR349418 Sequence 


132 


133. 8 


5. 


8 


1650 


6 


BD260955 


BD260955 Carbohydr 


133 


133.8 


5. 


8 


1650 


6 


AX040087 


AX040087 Sequence 


134 


133. 8 


5. 


8 


1894 


9 


HSU62899 


U62899 Human ceram 


135 


133.6 


5. 


8 


1976 


5 


BX936208 


BX936208 Gallus ga 


136 


133.4 


5. 


8 


1760 


6 


CQ727328 


CQ727328 Sequence 


137 


133.4 


5. 


8 


1766 


9 


HSA6054 


AJ006054 Homo sapi 


138 


132 


5. 


7 


1614 


6 


AX958396 


AX958396 Sequence 


139 


131.8 


5. 


7 


1851 


9 


AF177272 


AF177272 Homo sapi 


140 


131.8 


5. 


7 


2093 


5 


BC076044 


BC076044 Danio rer 


141 


131.6 


5. 


7 


1846 


10 


RNU06274 


U06274 Rattus norv 


142 


131.6 


5. 


7 


2079 


9 


HSAJ5162 


AJ005162 Homo sapi 


143 


131.6 


5. 


7 


2092 


6 


BD229135 


BD229135 Genotype 


144 


131.6 


5. 


7 


2092 


6 


AR349387 


AR349387 Sequence 


145 


131.6 


5. 


7 


2129 


9 


BC026264 


BC026264 Homo sapi 


146 


130.6 


5. 


6 


1649 


9 


BC069441 


BC069441 Homo sapi 


147 


130.6 


5. 


6 


2818 


4 


OCU72742 


U72742 Oryctolagus 


148 


130 


5. 


6 


2097 


9 


AF064200 


AF064200 Homo sapi 


149 


130 


5. 


6 


2123 


6 


AX410640 


AX410640 Sequence 


150 


130 


5. 


6 


2123 


9 


HSU06641 


U06641 Human UDP g 


151 


129 


5. 


6 


1722 


9 


AF016492 


AF016492 Homo sapi 


152 


129 


5. 


6 


2799 


6 


AX411112 


AX411112 Sequence 


153 


129 


5. 


6 


2799 


9 


HSUGT2BI0 


X63359 H. sapiens U 


154 


128.6 


5. 


5 


1500 


9 


AF177274 


AF177274 Homo sapi 


155 


128.4 


5. 


5 


1976 


6 


BD229238 


BD229238 Genotype 


156 


128.4 


5. 


5 


1976 


6 


AR349490 


AR349490 Sequence 


157 


128.4 


5. 


5 


2075 


9 


AF072223 


AF072223 Macaca fa 


158 


128.4 


5. 


5 


2090 


6 


AX410646 


AX410646 Sequence 


159 


128.4 


5. 


5 


2090 


9 


HSU08854 


U08854 Human UDP g 


160 


128.4 


5. 


5 


2091 


9 


AF081793 


AF081793 Homo sapi 


161 


128.4 


5. 


5 


2092 


9 


AF548389 


AF548389 Homo sapi 


162 


128.4 


5. 


5 


2114 


9 


AF180322 


AF180322 Homo sapi 


163 


12,8..^ 


5. 


5 


1822 


5 


BC075289 


BC025289 Xenopus t 


164 


128.2 


5. 


5 


3062 


10 


BC028826 


BC028826 Mus muscu 


165 


127.6 


5. 


5 


1413 


6 


AR208679 


AR208679 Sequence 


166 


127.6 


5. 


5 


1413 


6 


AR490895 


AR490895 Sequence 


167 


125.2 


5. 


4 


2093 


6 


AX411198 


AX411198 Sequence 


168 


125.2 


5. 


4 


2093 


9 


HSUDPGT 


Y00317 Human mRNA 


169 


124.8 


5. 


4 


2816 


4 


AF039138 


AF039138 Felis cat 


170 


124.8 


5. 


4 


2821 


4 


AF039137 


AF039137 Felis cat 


171 


123.8 


5. 


3 


1895 


5 


BC075892 


BC075892 Danio rer 


172 


122.4 


5. 


3 


2103 


10 


BC025795 


BC025795 Mus muscu 


173 


121.6 


5. 


2 


1602 


9 


AF360121 


AF360121 Macaca mu 


174 


121.6 


5. 


2 


2349 


4 


AB010872 


AB010872 Felis cat 



175 


120.8 


5 . 


2 


1590 


4 


AF011902 


AF011902 Oryctolag 


176 


Izi U . 0 


c 

O . 


2 


1 O O /I 


o 


AX958404 


AX958404 Sequence 


177 


120 


5 . 


2 


1647 


9 


AF104339 


AF104339 Macaca fa 


178 


119 . 6 


5 . 


2 


1587 


4 


CFA290948 


AJ290948 Cams ram 


179 


119 . 6 


5 . 


2 


2342 


9 


AY435136 


AY435136 Homo sapi 


180 


119 . 6 


5 . 


2 


2351 


6 


CQ720471 


CQ720471 Sequence 


181 


119 . 6 


5. 


2 


2351 


6 


AX706961 


AX706961 Sequence 


182 


119 . 6 


5 . 


2 


2351 


6 


AX707891 


AX707891 Sequence 


183 


119 . 6 


5 . 


2 


2351 


9 


HUMHUGBRl 


M57899 Human bilir 


184 


119.2 


5. 


1 


1621 


9 


AF104338 


AF104338 Macaca fa 


185 


119 


5. 


1 


1961 


5 


BC055635 


BC055635 Danio rer 


186 


118 . 8 


5 . 


1 


2396 


9 


AF104336 


AF104336 Macaca fa 


187 


117 . 6 


5 . 


1 


2339 


9 


AY435141 


AY435141 Homo sapi 


188 


117. 6 


5. 


1 


2440 


9 


AF104337 


AF104337 Macaca fa 


189 


116. 8 


. 5. 


0 


1602 


9 


AY029169 


AY029169 Macaca mu 


190 


116. 2 


5. 


0 


2196 


10 


AY227201 


AY227201 Mus muscu 


191 


116 


5. 


0 


2388 


9 


BC019861 


BC019861 Homo sapi 


192 


115. 8 


5. 


0 


2422 


6 


AX336711 


AX336711 Sequence 


193 


115. 8 


5. 


0 


2422 


6 


AX409459 


AX409459 Sequence 


194 


115. 8 


5. 


0 


2422 


9 


HUMUGTIFA 


J04093 Homo sapien 


195 


114 . 6 


4 . 


9 


2216 


10 


AY435129 


AY435129 Rattus no 


196 


114 . 4 


4 . 


9 


2345 


9 


AY435138 


AY435138 Homo sapi 


197 


114.4 


4. 


9 


2345 


9 


AY435140 , 


AY435140 Homo sapi 


198 


114.2 


4. 


9 


2333 


9 


AY435142 


AY435142 Homo sapi 


199 


113.2 


4 . 


9 


2392 


9 


S55985 


S55985 Homo sapien 


200 


112. 8 


4 . 


9 


2345 


9 


AY435139 


AY435139 Homo sapi 


201 


112. 8 


4. 


9 


2368 


9 


HUMHUGBR2 


M57951 Human bilir 


202 . 


112. 6 


4 . 


9 


1593 


9 


HSU89508 


U89508 Human UDP-g 


203 


112 . 6 


4 . 


9 


2003 


9 


AK025403 


AK025403 Homo sapi 


204 


112 . 6 


4 . 


9 


2333 


9 


AY435137 


AY435137 Homo sapi 


205 


112 . 6 


4 . 


9 


2390 


9 


BC069210 


BC069210 Homo sapi 


206 


112 . 6 


4 . 


9 


2423 


9 


BC020971 


BC020971 Homo sapi 


207 


111 . 8 


4 . 


8 


1590 


4 


AB008677 


AB008677 Bos tauru 


208 


111. 8 


4 . 


8 


2219 


10 


AY435131 


AY435131 Rattus no 


209 


111.4 


4 . 


8 


2320 


6 


AX286762 


AX286762 Sequence 


210 


111. 4 


4 . 


8 


2320 


6 


AX286764 


AX286764 Sequence 


211 


111 . 4 


4 . 


8 


2320 


6 


AX356964 


AX356964 Sequence 


212 


111 . 4 


4 . 


8 


2320 


9 


AF056188 


AF056188 Homo sapi 


213 


111 . 4 


4 . 


8 


2333 


9 


AY435144 


AY435144 Homo sapi 


214 


111.4 


4 . 


8 


2405 


9 


BC058844 


BC058844 Homo sapi 


215 


111.4 


4 . 


8 


2585 


6 


CQ796153 


CQ796153 Sequence 


216 


111.4 


4 . 


8 


2585 


6 


CQ796154 


CQ796154 Sequence 


217 


111.4 


4 . 


8 


2585 


6 


CQ796155 


CQ796155 Sequence 


218 


111 


4 . 


8 


1593 


9 


AF030310 


AF030310 Homo sapi 


219 


111 


4 . 


8 


1593 


9 


HSU89507 


U89507 Human UDP-g 


220 


111 


4 . 


.8^ 


.w. 2333 


9 


AY435143 


AY435143 Homo...sapi 


221 


111 


4 . 


8 


2363 


9 


AF462267 


AF462267 Homo sapi 


222 


111 


4 . 


8 


2394 


9 


AF462268 


AF462268 Homo sapi 


223 


109. 8 


4 . 


7 


2216 


10 


AY435135 


AY435135 Rattus no 


224 


109. 8 


4 . 


7 


2241 


10 


AF461735 


AF461735 Rattus no 


225 


109.6 


4 . 


7 


1655 


5 


PPL249081 


AJ249081 Pleuronec 


226 


109. 6 


4. 


7 


2287 


4 


AB018477 


AB018477 Ovis arie 


227 


109. 6 


4. 


7 


211178 


10 


AC139209 


AC139209 Mus muscu 


228 


109.2 


4. 


7 


241472 


5 


BX005027 


BX005027 Zebrafish 


c 229 


108.8 


4. 


7 


154733 


2 


AC127462 


AC127462 Danio rer 


c 230 


108.8 


4. 


7 


247221 


2 


BX470211 


BX470211 Danio rer 


231 


108.4 


4. 


7 


2284 


10 


AF461736 


AF461736 Rattus no 





o o o 

232 


iUo . z 


4 


. 7 


O 1 ft ^ 

2194 


10 


MUSUGTBR 




o o o 

233 


lOo . 2 


4 


. 7 


ft ft T C 

2215 


10 


S64760 




234 


TOO O 

108.2 


4 


. 7 


2224 


10 


AY227194 




o o c 

235 


108 


4 


. 7 


ft T ft ft 

2190 


10 


AY227200 




o o ^ 

23d 


108 


4 


. 7 


ft ft ft o 

2203 


1 ft 

10 


BC019434 




O O T 

237 


TOO 

108 


4 


. 7 


ft ft 1 c 

2215 


10 


AK128918 




o o o 

238 


1 ft ^ o 

105 . 8 


4 


. 6 


2231 


10 


AY435128 




239 


T ft ^ ft 

106 . 8 


4 


. 6 


2239 


10 


RNU20551 




240 


106 . 6 


4 


. 6 


2219 


10 


AY435134 




241 


106. 6 


4 


. 6 


2225 


10 


AY435130 




242 


1 ft ^ £ 
106 . 6 


4 


. 6 


2282 


10 


AF461738 




243 


106 . 6 


4 


. 6 


2285 


10 


BC078732 




244 


105 . 4 


4 


. 5 


2194 


10 


AB094481 




245 


105. 4 


4 


. 5 


2203 


10 


AY227196 




246 


105 . 2 


4 


. 5 


1968 


5 


PPUGTMRN 




247 


105 


4 


. 5 


1765 


10 


RATUDPGTA 




248 


104 . 8 


4 


. 5 


2155 


10 


MUSUGTP 




249 


104 . 8 


4 


. 5 


2236 


5 


AB120133 




250 


104 


4 


. 5 


151144 


5 


BX323575 




251 


103 . 8 


4 


. 5 


2219 


10 


AY435132 




252 


103 . 8 


4 


. 5 


2297 


10 


AF461734 




253 


103 . 4 


4 


. 5 


2037 


4 


OCU09101 




254 


103.2 


4 


. 4 


2193 


10 


AY227199 




255 


101 . 6 


4 


. 4 


1584 


6 


CQ586740 




256 


101 . 6 


4 


. 4 


1602 


10 


D87866 




257 


101 . 6 


4 


. 4 


1619 


3 


AY070934 




258 


101 . 6 


4 


. 4 


2211 


10 


AY227195 


c 


259 


101 . 6 


4 


. 4 


3650 


6 


CQ586739 


c 


260 


101 . 6 


4 


. 4 


164386 


3 


AC006491 


c 


261 


101 . 6 


4 


. 4 


210298 


3 


AE003690 




262 


100 • 8 


4 


. 3 


2332 


4 


AB018478 




263 


100 


4 


. 3 


1634 


10 


MMU09930 


c 


264 


100 


4 


. 3 


24145 


2 


AC014455 




265 


98 . 8 


4 


. 3 


1212 


6 


CQ597654 




266 


98 . 8 


4 


. 3 


1704 


'6 


CQ600372 




267 


98 . 8 


4 


. 3 


1980 


3 


BT006007 




268 


98 . 8 


4 


. 3 


1987 


3 


AY060891 


c 


269 


98 . 8 


4 


. 3 


3212 


6 


CQ597653 




270 


98 . 8 


4 


. 3 


4357 


6 


CQ600371 




271 


98 . 8 


4 


. 3 


24145 


2 


AC014455 




272 


98 . 8 


4 


. 3 


164386 


3 


AC006491 




273 


98 . 8 


4 


. 3 


210298 


3 


AE003690 




274 


98 . 4 


4 


.2 


2290 


10 


BC026561 




275 


98 . 4 


4 


. 2 


2303 


10 


BC069940 




276 


97 . 6 


4 


. 2 


1458 


6 


CQ590931 




277 


97 . 6 


4 


. 2 


3458 




..CQ590930 




278 


97 . 6 


4 


.2 


37688 


2 


AL513222 




279 


97 . 6 


4 


. 2 


37688 


3 


DMCEG0003 




280 


97 . 6 


4 


. 2 


95850 


2 


AC020063 




281 


97 . 6 


4 


.2 


154985 


3 


AC004641 


c 


282 


97 . 6 


4 


. 2 


160338 


3 


AC099021 


c 


283 


97 . 6 


4 


. 2 


272948 


3 


AE003805 




284 


97 


4 


.2 


1593 


10 


D83796 




285 


97 


4 


.2 


2216 


10 


AY435133 




286 


97 


4 


.2 


2343 


10 


AF461737 




287 


96.8 


4 


.2 


1596 


10 


D87867 




288 


96.8 


4 


.2 


2265 


10 


AY227197 



L02333 Murine bill 
S64760 UGTBrl=UDP- 
AY227194 Mus muscu 
.AY227200 Mus muscu 
BC019434 Mus muscu 
AK128918 Mus muscu 
AY435128 Rattus no 
U20551 Rattus norv 
AY435134 Rattus no 
AY435130 Rattus no 
AF461738 Rattus no 
BC078732 Rattus no 
AB094481 Mus muscu 
AY227196 Mus muscu 
X74116 P.platessa 
M34007 Rat bilirub 
L27122 Mus musculu 
AB120133 Pleuronec 
BX323575 Zebrafish 
AY435132 Rattus no 
AF461734 Rattus no 
U09101 Oryctolagus 

AY227199 Mus muscu 
CQ586740 Sequence 

D87866 Mus musculu 
AY070934 Drosophil 

AY227195 Mus muscu 
CQ586739 Sequence 
AC006491 Drosophil 
AE003690 Drosophil 
AB018478 Ovis arie 

U09930 Mus musculu 
AC014455 Drosophil 
CQ597654 Sequence 
CQ600372 Sequence 
BT006007 Drosophil 
AY060891 Drosophil 
CQ597653 Sequence 
CQ600371 Sequence 
AC014455 Drosophil 
AC006491 Drosophil 
AE003690 Drosophil 
BC026561 Mus muscu 
BC069940 Mus muscu 
CQ590931 Sequence 
CQ590930 Sequence 
AL513222 Drosophil 
AL031863 Drosophil 
AC020063 Drosophil 
AC004641 Drosophil 
AC099021 Drosophil 
AE003805 Drosophil 
D83796 Rat UGTl mR 
AY435133 Rattus no 
AF461737 Rattus no 
D87867 Mus musculu 
AY227197 Mus muscu 





289 


96. 8 


4 . 2 


2300 


10 


MMU16818 


U16818 Mus musculu 




290 


96 . 8 


4.2 


120848 


2 


AC007801 


AC007801 


Drosophil 




291 


95 . 6 


4 . 1 


735 


6 


AR252918 


AR252918 


Sequence 




292 


95 . 4 


4 . 1 


1927 


10 


RATUDPGT 


J02612 Rat UDP-glu 




293 


95 . 2 


4 . 1 


2248 


10 


AY227198 


AY227198 Mus muscu 


c 


294 


94 . 6 


4 . 1 


168062 


2 


AC012857 


AC012857 


Drosophil 




295 


94 . 6 


4 . 1 


172479 


3 


AC006590 


AC006590 


Drosophil 




296 


94 . 6 


4 . 1 


260027 


3 


AE003659 


AE003659 


Drosophil 




297 


94 


4 . 1 


2055 


4 


OCU09030 


U09030 Oryctolagus 




298 


93 . 4 


4 . 0 


1560 


6 


CQ586749 


CQ586749 


Sequence 


c 


299 


93.4 


4 . 0 


3828 


6 


CQ586748 


CQ586748 


Sequence 




300 


93 


4 . 0 


1652 


6 


CQ580647 


CQ580647 


Sequence 




301 


93 


4 . 0 


1766 


3 


AY070917 


AY070917 


Drosophil 


c 


302 


93 


4 . 0 


3991 


6 


CQ580646 


CQ580646 


Sequence 


c 


303 


93 


4.0 


14112 


6 


CQ592361 


CQ592361 


Sequence 


c 


304 


93 


4.0 


19914 


2 


AC018009 


AC018009 


Drosophil 




305 


93 


4 . 0 


177339 


3 


AC008231 


AC008231 


Drosophil 


c 


306 


93 


4.0 


305150 


3 


AE003453 


AE003453 


Drosophil 




307 


92.8 


4.0 


662 


6 


AX525627 


AX525627 


Sequence 




308 


91. 8 


4.0 


642 


6 


AX526169 


AX526169 


Sequence 




309 


91. 6 


3.9 


662 


6 


M525915 


AX525915 


Sequence 




310 


91.2 


3.9 


1662 


9 


BC053576 


BC053576 


Homo sapi 




311 


91 


3.9 


1648 


6 


CQ609717 


CQ609717 


Sequence 




312 


91 


3.9 


1692 


3 


AY118747 


AY118747 


Drosophil 




313 


88-8 


3.8 


1673 


3 


AY071256 


AY071256 


Drosophil 


c 


314 


88.8 


3.8 


5278 


6 


CQ611354 


CQ611354 


Sequence 


c 


315 


88. 8 


3.8 


77707 


2 


AC014787 


AC014787 


Drosophil 


c 


316 


88.8 


3.8 


177480 


3 


AC008287 


AC008287 


Drosophil 


c 


317 


88.8 


3.8 


225974 


3 


AE003778 


AE003778 


Drosophil 




318 


88.2 


3.8 


2484 


10 


RATUGT 


J05132 Rat 3-methy 




319 


87.4 


3.8 


1595 


6 


CQ586224 


CQ586224 


Sequence 




320 


86.2 


3.7 


874 


6 


AX421573 


AX421573 


Sequence 




321 


85.2 


3.7 


3716 


6 


CQ614606 


CQ614606 


Sequence 




322 


82.6 


3.6 


1572 


6 


CQ612552 


CQ612552 


Sequence 




323 


82.6 


3.6 


3572 


6 


CQ612551 


CQ612551 


Sequence 




324 


82. 6 


3.6 


60144 


2 


AC019798 


AC019798 


Drosophil 


c 


325 


82^6 


3.6 


91685 


2 


AC005121 


AC005121 


Drosophil 




326 


82.6 


3.6 


175413 


3 


AC093097 


AC093097 


Drosophil 




327 


82.6 


3.6 


262525 


3 


AE003652 


AE003652 


Drosophil 


c 


328 


80.8 


3.5 


4016 


6 


CQ586223 


CQ586223 


Sequence 


c 


329 


80.8 


3.5 


163325 


3 


AC008309 


AC008309 


Drosophil 


c 


330 


80. 8 


3.5 


167475 


3 


AC016132 


AC016132 


Drosophil 




331 


80.8 


3.5 


220936 


2 


AC020256 


AC020256 


Drosophil 


c 


332 


80.8 


3.5 


241480 


3 


AE003734 


AE003734 


Drosophil 




333 


80.6 


3.5 


1593 


6 


CQ613314 


CQ613314 


Sequence 




334 


80.6 


3.5 


3659 


6 


CQ613313.-... 


CQ613313 


Sequence 




335 


80.6 


3.5 


174677 


3 


AC018479 


AC018479 


Drosophil 




336 


80.6 


3.5 


193248 


3 


AC007752 


AC007752 


Drosophil 




337 


80.6 


3.5 


221239 


3 


AE003697 


AE003697 


Drosophil 


c 


338 


80. 6 


3.5 


324288 


2 


AC017336 


AC017336 


Drosophil 




339 


80 


3.4 


170142 


9 


AP002383 


AP002383 


Homo sapi 




340 


80 


3.4 


174742 


2 


AC016896 


AC016896 


Homo sapi 




o41 


80 


3 . 4 




2 


AC032024 


AC032024 


Homo sapi 




342 


79.4 


3.4 


1413 


6 


CQ611790 


CQ611790 


Sequence 




343 


79.4 


3.4 


2820 


6 


CQ611355 


CQ611355 


Sequence 




344 


79.2 


3.4 


7218 


6 


166494 


166494 Sequence 14 




345 


78.8 


3.4 


804 


6 


CQ727942 


CQ727942 


Sequence 





346 


78. 8 


3. 


4 


1988 


3 


AY069392 


AY069392 


Drosophil 




347 


78 . 8 


3. 


4 


2437 


3 


AY071467 


AY071467 


Drosophil 




348 


78 . 6 


3. 


4 


678 


6 


AX526170 


AX526170 


Sequence 


c 


349 


78.6 


3. 


4 


235952 


2 


ACllOlOl 


ACllOlOl 


Rattus no 


c 


350. 


78.6 


3. 


4 


256974 


2 


AC095475 


AC095475 


Rattus no 


c 


351 


78 


3. 


4 


195466 


2 


AC032024 


AC032024 


Homo sapi 




352 


77.8 


3. 


4 


5205 


6 


CQ609716 


CQ609716 


Sequence 


c 


353 


77. 8 


3. 


4 


128469 


2 


AC020029 


AC020029 


Drosophil 




354 


77.8 


3. 


4 


172061 


3 


AC009203 


AC009203 


Drosophil 




355 


77.8 


3. 


4 


232737 


3 


AE003660 


AE003660 


Drosophil 




356 


77.2 


3. 


3 


189 


6 


CQ740741 


CQ740741 


Sequence 




357 


77 . 


3. 


3 


1352 


10 


MMGCGTEXl 


X92123 M.musculus 




358 


77 


3. 


3 


2023 


10 


MMCGT2 


U48892 Mus musculu 




359 


77 


3. 


3 


202752 


2 


AC111106 


AC111106 


Mus muscu 




360 


76.6 


3. 


3 


696 


6 


CQ734321 


CQ734321 


Sequence 




361 


76.4 


3. 


3 


169056 


2 


CR381587 


CR381587 


Danio rer 




362 


74.6 


3. 


2 


603 


6 


AX401758 


AX401758 


Sequence 




363 


74.6 


3. 


2 


603 


10 


S56937 


S56937 3-methylcho 




364 


74 


3. 


2 


582 


6 


CQ458675 


CQ458675 


Sequence 




365 


73.8 


3. 


2 


4389 


2 


AC014898 


AC014898 


Drosophil 


c 


366 


73.8 


3. 


2 


171334 


3 


AC007477 


AC007477 


Drosophil 


c 


367 


73.8 


3. 


2 


196516 


3 


AC013450 


AC013450 


Drosophil 




368 


73.8 


3. 


2 


280240 


3 


AE003786 


AE003786 


Drosophil 


c 


369 


73.2 


3. 


2 


237028 


2 


BX000470 


BX000470 


Danio rer 




370 


72.2 


3. 


1 


1443 


9 


HSCGTOl 


U31353 Human UDP-G 




371 


72.2 


3. 


1 


149028 


9 


AC122938 


AC122938 


Homo sapi 




372 


72 


3. 


1 


293 


6 


CQ427045 


CQ427045 


Sequence 




373 


72 


"3. 


1 


439 


6 


CQ417459 


CQ417459 


Sequence 




374 


71.2 


3. 


1 


1681 


3 


AF324465 


AF324465 


Bombyx mo 




375 


70.8 


3. 


1 


601 


6 


AX401757 


AX401757 


Sequence 




376 


70. 8 


3. 


1 


601 


10 


S56936 


S56936 bilirubin-s 




377 


70.2 


3. 


0 


246176 


2 


AC114845 


AC114845 


Rattus no 




378 


69.4 


3. 


0 


1680 


6 


CQ586665 


CQ586665 


Sequence 




379 


69.4 


3. 


0 


1781 


3 


AY051661 


AY051661 


Drosophil 




380 


69.4 


3. 


0 


1782 


6 


CQ586689 


CQ586689 


Sequence 


c 


381 


69.4 


3. 


0 


3741 


6 


CQ586664 


CQ586664 


Sequence 


c 


382 


69.4 


3. 


0 


3849 


6 


CQ586688 


CQ586688 


Sequence 


c 


383 


69.4 


3. 


0 


30221 


2 


AC014454 


AC014454 


Drosophil 


c 


384 


69.4 


3. 


0 


165136 


3 


AC007645 


AC007645 


Drosophil 




385 


69.2 


3. 


0 


1491 


6 


CQ581757 


CQ581757 


Sequence 




386 


69.2 


3. 


0 


3784 


6 


CQ581756 


CQ581756 


Sequence 


c 


387 


68.6 


3. 


0 


3595 


6 


CQ613379 


CQ613379 


Sequence 


c 


388 


68.6 


3. 


0 


174677 


3 


AC018479 


AC018479 


Drosophil 


c 


389 


68.6 


3. 


0 


193248 


3 


AC007752 


AC007752 


Drosophil 


c 


390 


68.6 


3. 


0 


221239 


3 


AE003697 


AE003697 


Drosophil 


c 


391 


68 . 6 




0 


246724 


2 


AC131807 


AC13180X 


Rattus no 




392 


68.6 


3. 


0 


324288 


2 


AC017336 


AC017336 


Drosophil 




393 


68.2 


2. 


9 


769 


6 


AR497103 


AR497103 


Sequence 




394 


68.2 


2. 


9 


769 


6 


AR512385 


AR512385 


Sequence 




395 


68 


2. 


9 


1539 


6 


CQ613380 


CQ613380 


Sequence 




396 


67.8 


2. 


9 


2972 


14 


AF373031 


AF373031 Epinotia 


c 


397 


67.4 


2. 


9 


224097 


2 • 


AC117901 


AC117901 


Rattus no 


c 


398 


67.4 


2. 


9 


251769 


2 


AC117913 


AC117913 


Rattus no 




399 


66.6 


2. 


9 


1793 


3 


AF116555 


AF116555 


Drosophil 




400 


66.4 


2. 


9 


11728 


10 


RATUDPGV 


M74439 Rattus norv 


c 


401 


66.4 


2. 


9 


138578 


2 


AC141352 


AC141352 


Rattus no 




402 


66.4 . 


2. 


9 


146839 


2 


AC142006 


AC142006 


Rattus no 





403 


66 


. 2 


2 . 


9 


1931 


6 


CQ609720 


CQ609720 


Sequence 




404 


66 


. 2 


2 . 


9 


1952 


3 


AY051629 


AY051629 


Drosophil 




405 




66 


2 . 


8 


1188 


6 


CQ597045 


CQ597045 


Sequence 


c 


406 




66 


2 . 


8 


3188 


6 


CQ597044 


CQ597044 


Sequence 




407 




66 


2. 


8 


42006 


2 


AC018022 


AC018022 


Drosophil 


c 


408 




66 


2 . 


8 


162923 


3 


AC008236 


AC008236 


Drosophil 


c 


409 




66 


2 . 


8 


210258 


3 


AE003743 


AE003743 


Drosophil 




410 


65 


. 8 


2 . 


8 


1672 


6 


CQ598692 


CQ598692 


Sequence 




411 


65 


. 8 


2 . 


8 


1946 


3 


AY070939 


AY070939 


Drosophil 




412 


65 


. 8 


2 . 


8 


3751 


6 


CQ598691 


CQ598691 


Sequence 


c 


413 


65 


. 8 


2. 


8 


170362 


3 


AC009250 


AC009250 


Drosophil 




414 


65 


. 6 


2 . 


8 


179810 


9 


AC139345 


AC139345 


Papio anu 




415 


65 


. 4 


2 . 


8 


1488 


6 


CQ597786 


CQ597786 


Sequence 




416 


65 


. 4 


2 . 


8 


1584 


6 


CQ609723 


CQ609723 


Sequence 




417 


65 


. 4 


2 . 


8 


1920 


3 


AY069532 


AY069532 


Drosophil 




418 


65 


.4 


2, 


8 


1977 


6 


CQ729719 


CQ729719 


Sequence 




419 


65 


. 4 


2. 


8 


3613 


6 


CQ594473 


CQ594473 


Sequence 




420 


65 


.4 


2. 


8 


4082 


6 


CQ597785 


CQ597785 


Sequence 




421 


65 


.4 


2. 


8 


4082 


6 


CQ609722 


CQ609722 


Sequence 




422 


65 


. 4 


2 . 


8 


leiiii 


3 


AC009204 


AC009204 


Drosophil 




423 


64 


.8 


2 . 


8 


2216 


10 


RATUDPB 


M33747 Rat UDP glu 




424 


64 


. 8 


2. 


8 


150430 


10 


AC133491 


AC1334 91 Rattus no 


c 


425 


64 


. 8 


2. 


8 


150430 


10 


AC133491 


AC133491 Rattus no 


c 


426 


64 


. 8 


2 . 


8 


246176 


2 


AC114845 


AC114845 


Rattus no 




427 


64 


. 8 


2 . 


8 


246724 


2 


AC131807 


AC131807 


Rattus no 


c 


428 


64 


. 6 


2 . 


8 


1369 


6 


AR508827 


AR508827 


Sequence 




429 


64 


. 6 


2. 


8 


1623 


6 


CQ593769 


CQ593769 


Sequence 


c 


430 


64 


. 6 


2. 


8 


3683 


6 


CQ593768 


CQ593768 


Sequence 




431 


63 


. 8 


2 . 


8 


500 


6 


CQ071564 


CQ071564 


Sequence 




432 


63 


. 8 


2 . 


8 


500 


6 


CQ102164 


CQ102164 


Sequence 




433 


63 


. 8 


2. 


8 


500 


6 


CQ141114 


CQ141114 


Sequence 




434 


63 


.8 


2. 


8 


500 


6 


CQ176767 


CQ176767 


Sequence 




435 


63 


. 8 


2 . 


8 


500 


6 


CQ224383 


CQ224383 


Sequence 




436 


63 


. 8 


2 . 


8 


500 


6 


CQ262401 


CQ262401 


Sequence 




437 


63 


. 8 


2 . 


8 


500 


6 


CQ336589 


CQ336589 


Sequence 




438 


63 


. 8 


2 . 


8 


689 


6 


BD229133 


BD229133 


Genotype 




439 


63 


. 8 


2. 


8 


689 


6 


AR349385 


AR349385 


Sequence 




440 


63 


. 8 


2. 


8 


1521 


6 


CQ592386 


CQ592386 


Sequence 




441 


63 


.8 


2. 


8 


3576 


6 


CQ592385 


CQ592385 


Sequence 


c 


442 


63 


. 8 


2. 


8 


27310 


9 


AF135416 


AF135416 


Homo sapi 


c 


443 


63 


. 8 


2. 


8 


103343 


9 


AC108078 


AC108078 


Homo sapi 


c 


444 


63 


. 8 


2. 


8 


138232 


2 


AC012582 


AC012582 


Homo sapi 


c 


445 


63 


.8 


2. 


8 


226077 


2 


AC013296 


AC013296 


Homo sapi 




446 


63 


. 6 


2. 


7 


145299 


5 


BX005348 


BX005348 


Zebraf ish 




447 


63 


. 6 


2. 


7 


229237 


5 


BX323548 


BX323548 


Zebraf ish 




448 


63 


. 6 


2 . 


7 


238195 


2 


^CR790368 


CR790368 


Danio rer 




449 


63 


.4 


2. 


7 


1367 


9 


AY529122 


AY529122 


Homo sapi 




450 


63 


. 4 


2. 


7 


1755 


3 


AY071432 


AY071432 


Drosophil 




451 


63 


. 4 


2 . 


7 


21000 


6 


AX395166 


AX395166 


Sequence 


c 


452 


63 


. 4 


2. 


7 


132963 


2 


AC017518 


AC017518 


Drosophil 




453 


63 


. 4 


2. 


7 


146309 


2 


CR388385 


GR388385 


Homo sapi 




454 


63 


. 4 


2. 


7 


173723 


3 


AC009351 


AC009351 


Drosophil 




455 


63 


. 4 


2. 


7 


217991 


9 


AC021146 


AC021146 


Homo sapi 




456 


63 


.4 


2. 


7 


280096 


3 


AE003613 


AE003613 


Drosophil 




457 


63 


.2 


2. 


7 


1047 


6 


CQ733586 


CQ733586 


Sequence 




458 


62 


.6 


2. 


7 


521 


10 


MMCGT5 


U48895 Mus musculu 




459 


62 


.6 


2. 


7 


156225 


2 


AC115007 


AC115007 


Mus muscu 





460 


62 . 6 


2. 


7 


205079 


10 


AC118544 




461 


62.6 


2. 


7 


240490 


2 


AC100269 




462 


62 . 4 


2. 


7 


391 


6 


BD237020 




463 


62.4 


2. 


7 


391 


6 


AR225420 




4 64 


62 . 4 


2. 


7 


391 


6 


AR562843 




465 


62 . 4 


2. 


7 


391 


6 


AX321490 




466 


62.2 


2. 


7 


1443 


6 


CQ595215 




467 


62.2 


2. 


7 


1591 


6 


BD229170 




468 


62.2 


2. 


7 


1591 


6 


AR349422 


c 


469 


62.2 


2. 


7 


3495 


6 


CQ595214 


c 


470 


62.2 


2. 


7 


34551 


2 


AC017171 


c 


471 


62.2 


2. 


7 


78938 


3 


AC002503 




472 


62.2 


2. 


7 


104079 


9 


AC107401 




473 


62.2 


2. 


7 


156578 


9 


AClllOOO 


c 


474 


62.2 


2. 


7 


174735 


3 


AC006402 


c 


475 


62.2 


2. 


7 


182901 


3 


AC093198 


c 


476 


62.2 


2. 


7- 


318278 


3 


AE003666 




477 


62 


2. 


7 


1543 


9 


AF177273 




478 


62 


2. 


7 


2301 


10 


RNU75903 




479 


62 


2. 


7 


209071 


3 


AC005285 




480 


61.8 


2. 


7 


1527 


6 


CQ580845 




481 


61.8 


2. 


7 


1757 


3 


AY128460 


c 


482 


61.8 


2. 


7 


4647 


6 


CQ580844 


c 


483 


61.8 


2. 


7 


78277 


2 


AC020206 




484 


61.8 


2. 


7 


81677 


3 


AC004377 


c 


485 


61.8 


2. 


7 


169534 


3 


AC008350 


c 


486 


61.8 


2. 


7 


302225 


3 


AE003458 




487 


61.6 


2, 


7 


29546 


3 


AY246561 


c 


488 


61.4 


2. 


6 


3679 


6 


CQ593771 




489 


61 


2. 


6 


851 


11 


BV017252 




490 


60.8 


2. 


6 


1692 


8 


AK109806 


c 


49i 


60.8 


2. 


6 


141545 


8 


OSJN00221 


c 


492 


60. 6 


2. 


6 


43323 


9 


AC114797 




493 


60.4 


2. 


6 


1500 


6 


CQ614607 




494 


60 


2. 


6 


220 


6 


CQ080780 




495 


60 


2. 


6 


220 


6 


CQ115243 




496 


60 


2. 


6 


220 


6 


CQ154074 




497 


60 


2. 


6 


220 


6 


CQ186655 




498 


60 


2. 


6 


220 


6 


CQ237309 




499 


60 


2. 


6 


220 


6 


CQ274933 




500 


60 


2. 


6 


220 


6 


CQ349280 




501 


60 


2. 


6 


402 


10 


MMGCGTEX4 




502 


59.6 


2. 


6 


1668 


6 


CQ586722 




503 


59.6 


2. 


6 


1676 


3 


AY051442 


c 


504 


59.6 


2. 


6 


3740 


6 


CQ586721 


c 


505 


59.6 


2. 


6 


110907 


14 


AY229987 




506 


59.4 


2. 


6 


1606 


6 


AX675577 




507 


59.4 


2. 


6 


1606 


6 


AX921811 




508 


59. 4 


2. 


6 


176321 


2 


BX942829 




509 


59.4 


2. 


6 


186564 


10 


AC119816 




510 


59.2 


2. 


6 


427 


9 


HSCGT04 


c 


511 


59 


2. 


5 


589 


6 


CQ225338 


c 


512 


59 


2. 


5 


589 


6 


CQ263337 




513 


59 


2. 


5 


103343 


9 


AC108078 


c 


514 


59 


2. 


5 


217991 


9 


AC021146 




515 


58.6 


2. 


5 


1648 


8 


AK064151 




516 


58.6 


2. 


5 


141040 


8 


AP004382 



AC118544 Mus muscu 
AC100269 Mus muscu 
BD237020 Compounds 
AR225420 Sequence 
AR562843 Sequence 
AX321490 Sequence 
CQ595215 Sequence 
BD229170 Genotype 
AR349422 Sequence 
CQ595214 Sequence 
AC017171 Drosophil 
AC002503 Drosophil 
AC107401 Homo sapi 
AClllOOO Homo sapi 
AC006402 Drosophil 
AC093198 Drosophil 
AE003666 Drosophil 
AF177273 Homo sapi 

U75903 Rattus norv 
AC005285 Drosophil 
CQ580845 Sequence 
AY1284 60 Drosophil 
CQ580844 Sequence 
AC020206 Drosophil 
AC004377 Drosophil 
AC008350 Drosophil 
AE003458 Drosophil 
AY246561 Branchios 
CQ593771 Sequence 

BV017252 S212P6006 
AK109806 Oryza sat 
AL663019 Oryza sat 
AC114797 Homo sapi 
CQ614607 Sequence 
CQ080780 Sequence 
CQ115243 Sequence 
CQ154074 Sequence 
CQ186655 Sequence 
CQ237309 Sequence 
CQ274933 Sequence 
CQ349280 Sequence 

X92126 M.musculus 
CQ586722 Sequence 
AY051442 Drosophil 
CQ586721 Sequence 

AY229987 Cryptophl 
AX675577 Sequence 
AX921811 Sequence 
BX942829 Danio rer 

AC119816 Mus muscu 
U31861 Human UDP-g 
CQ225338 Sequence 
CQ2 63337 Sequence 
AC108078 Homo sapi 
AC021146 Homo sapi 
AK064151 Oryza sat 
AP004382 Oryza sat 





517 


58 


2 . 


5 


609 


10 


RATUD2A07 


M35083 Rat UDP glu 




518 


58 


2 . 


5 


1704 


3 


AF116554 


AF116554 Drosophil 




519 


58 


2. 


5 


247575 


2 


AC125632 


AC125632 Rattus no 




520 


57 . 8 


2 . 


5 


2108 


14 


LOGVEGT 


Y08294 Lacanobia o 


c 


521 


57 . 8 


2 . 


5 


145253 


10 


AC138173 


AC138173 Mus muscu 


c 


522 


57. 8 


2 . 


5 


186564 


10 


AC119816 


AC119816 Mus muscu 


c 


523 


57.4 


2. 


5 


831 


6 


AR524865 


AR524865 Sequence 




524 


57.4 


2. 


5 


2036 


6 


CQ581913 


CQ581913 Sequence 




525 


57.4 


2 . 


5 


4386 


6 


CQ581912 


CQ581912 Sequence 


c 


526 


57.4 


2. 


5 


63705 


9 


AF135419 


AF135419 sHomo sap 




527 


57.4 


2. 


5 


67568 


2 


AC020378 


AC020378 Drosophil 


c 


528 


57. 4 


2. 


5 


123406 


8 


AC120508 


AC120508 Oryza sat 




529 


57.4 


2. 


5 


148977 


9 


AC114786 


AC114786 Homo sapi 




530 


57.2 


2. 


5 


149712 


2 


AC140853 


AC140853 Canis fam 


c 


531 


57.2 


2. 


5 


149712 


2 


AC140853 


AC140853 Canis fam 




532 


57.2 


2. 


5 


161844 


2 


AC147451 


AC147451 Otolemur 


c 


533 


57.2 


2. 


5 


208667 


10 


AC101835 


AC101835 Mus muscu 




534 


57 


2. 


5 


1665 


8 


AB009370 


7VB0 09370 Vigna mun 




535 


56.8 


2. 


4 


1416 


8 


MZEIAGLU 


L34847 Zea mays lA 




536 


56.8 


2. 


4 


1731 


6 


AR260578 


AR260578 Sequence 




537 


56.8 


2. 


4 


161046 


14 


AF081810 


AF081810 Lymantria 




538 


56.8 


2. 


4 


183755 . 


2 


AC146742 


AC146742 Papio anu 


c 


539 


56.8 


2. 


4 


183807 


9 


AC091778 


AC091778 Papio anu 


c 


540 


56.6 


2. 


4 


52742 


2 


AC147071 


AC147071 Homo sapi 


c 


541 


56. 6 


2. 


4 


96776 


9 


AC134921 


AC134921 Homo sapi 




542 


56.6 


2. 


4 


165501 


2 


AC147569 


AC147569 Homo sapi 


c 


543 


56. 6 


2. 


4 


184963 


9 


AC147055 


AC147055 Homo sapi 




544 


56.6 


2. 


4 


232016 


2 


AC148029 


AC148029 Homo sapi 




545 


56.2 


2. 


4 


513 


6 


AR509041 


AR509041 Sequence 




546 


56.2 


2. 


4 


983 


6 


AR284336 


AR284336 Sequence 




547 


56.2 


2. 


4 


4966 


6 


CQ609719 


CQ609719 Sequence 


c 


548 


56.2 


2. 


4 


160256 


2 


CR753888 


CR753888 Danio rer 




549 


56.2 


2. 


4 


183498 


5 


BX072578 


BX072578 Zebrafish 


c 


550 


56 


2. 


4 


183498 


5 


BX072578 


BX072578 Zebrafish 




551 


55.8 


2. 


4 


350 


6 


AX887421 


AX887421 Sequence 




552 


55.8 


2. 


4 


350 


6 


BD027031 


BD027031 Sequence 


c 


553 


55.6 


2. 


4 


106525 


9 


AC093720 


AC093720 Homo sapi 




554 


55.6 


2. 


4 


147009 


2 


AC108490 


AC108490 Homo sapi 




555 


55.6 


2. 


4 


153296 


9 


AC147070 


AC147070 Homo sapi 




556 


55.6 


2. 


4 


155175 


2 


AC147064 


AC147064 Homo sapi 




557 


55.6 


2. 


4 


181117 


9 


AC019173 


AC019173 Homo sapi 


c 


558 


55.4 


2. 


4 


590 


11 


BV100588 


BV100588 RPAMMSEQO 


c 


559 - 


55.2 


2. 


4 


203 


6 


CQ238174 


CQ238174 Sequence 


c 


560 


55.2 


2. 


4 


203 


6 


CQ275780 


CQ275780 Sequence 




561 


55 


2. 


4 


1602 


6 


BD229242 


BD229242 Genotype 


-tut 


.562 


55 


2. 


4 


1602 


6 


AR349494 


AR349494 Sequence 




563 


55 


2. 


4 


106525 


9 


AC093720 


AC093720 Homo sapi 




564 


54.6 


2. 


4 


.848 


6 


CQ722622 


CQ722622 Sequence 




565 


54.4 


2. 


3 


5122 


14 


AY250076 


AY250076 Spodopter 




566 


54.4 


2. 


3 


6120 


8 


AF331855 


AF331855 Zea mays 




567 


54.2 


2. 


3 


1933 


9 


AK130038 


AK130038 Homo sapi 


c 


568 


54 .2 


2. 


3 


3642 


6 


CQ611789 


CQ611789 Sequence 




569 


54 


2. 


3 


245842 


2 


AC106307 


AC106307 Rattus no 




570 


53.8 


2. 


3 


138232 


2 


AC012582 


AC012582 Homo sapi 


c 


571 


53.8 


2. 


3 


169483 


9 


AC093829 


AC093829 Homo sapi 




572 


53.8 


2. 


3 


228873 


2 


AC134364 


AC134364 Rattus no 


c 


573 


53.8 


2. 


3 


231284 


2 


AC095374 


AC095374 Rattus no 





574 


53. 6 


2 


.3 


1536 


8 


AB027454 


AB027454 


Petunia x 


c 


575 


53.4 


2 


. 3 


583 


11 


BV100584 


BV100584 RPAMMSEQO 


c 


576 


53.4 


2 


.3 


583 


11 


BV163609 


BVl 63609 RPAMMSEQO 




577 


53.4 


2 


.3 


784 


6 


AX356966 


AX356966 


Sequence 




578 


53.4 


2 


.3 


784 


9 


HUMUGT02 


M84123 Human UDP-g 




579 


53.4 


2 


.3 


1620 


6 


CQ593772 


CQ593772 


Sequence 




580 


53.4 


2 


.3 


16944 


9 


AY603772 


AY603772 


Homo sapi 


c 


581 


53.4 


2 


.3 


23787 


8 


AP005917 


AP005917 


Oryza sat 


c 


582 


53.4 


2 


.3 


68770 


9 


AC006985 


AC006985 


Homo sapi 


c 


583 


53.4 


2 


.3 


176619 


2 


AC026497 


AC026497 


Homo sapi 




584 


53.4 


2 


.3 


198872 


9 


AF297093 


AF297093 


Homo sapi 




585 


53.2 


2 


.3 


1730 


8 


AB070746 


AB070746 


Vigna ang 




586 


53 


2 


.3 


1813 


8 


AY257207 


AY257207 


Ipomoea t 




587 


52.8 


2 


.3 


125020 


9 


AF429315 


AF429315 


Homo sapi 


c 


588 


52.6 


2 


.3 


82746 


1 


AF453501 


AF453501 


Actinosyn 




589 


52.4 


2 


.3 


209586 


2 


AC148497 


AC148497 


Otolemur 




590 


52 


2 


.2 


707 


6 


CQ492835 


CQ492835 


Sequence 




591 


52 


2 


.2 


90972 


8 


AP006584 


AP006584 


Oryza sat 


c 


592 


52 


2 


.2 


122193 


2 


AP006053 


AP006053 


Oryza sat 




593 


51.8 


2 


.2 


1584 


8 


AB103471 


AB103471 


Aralia co 


c 


594 


51.6 


2 


.2 


576 


11 


BV163611 


BV163611 RPAMMSEQO 




595 


51.6 


2 


.2 


1594 


8 


AB191247 


AB191247 


Dianthus 




596 


51.6 


2 


.2 


105931 


2 


AC005427 


AC005427 


Drosophil 


c 


597 


51.6 


2 


.2 


176321 


2 


BX942829 


BX942829 


Danio rer 




598 


51.4 


2 


.2 


1490 


8 


AY257208 


AY257208 


Ipomoea n 




599 


51.4 


2 


.2 


1786 


8 


AB038248 


AB038248 


Ipomoea b 


c 


600 


51.4 


2 


.2 


94516 


8 


AP004526 


AP004526 


Lotus cor 




601 


51 


2 


.2 


1378 


8 


AB070743 


AB070743 


Vigna ang 


c 


602 


51 


2 


.2 


1640 


8 


AK099344 


AK099344 


Oryza sat 




603 


51 


2 


.2 


1686 


8 


AK106005 


AK106005 


Oryza sat 




604 


51 


2 


.2 


1709 


8 


AK105785 


AK105785 


Oryza sat 


c 


605 


51 


2 


.2 


1747 


8 


AK064324 


AK064324 


Oryza sat 


c 


606 


51 


2 


.2 


1790 


8 


AK101518 


AK101518 


Oryza sat 




607 


51 


2 


.2 


1862 


8 


AK100744 


AK100744 


Oryza sat 




608 


51 


2 


.2 


124591 


8 


AC118347 


AC118347 


Oryza sat 


c 


609 


51 


2 


.2 


146154 


2 


AP004400 


AP004400 


Oryza sat 


c 


610 


51 


2 


.2 


156069 


8 


AC136842 


AC136842 


Oryza sat 




611 


51 


2 


.2 


183095 


8 


AP005186 


AP005186 


Oryza sat 




612 


50.8 


2 


.2 


1545 


8 


AB191245 


AB191245 


Dianthus 


c 


613 


50.8 


2 


.2 


8596 


14 


AF527603 


AF527603 Spodopter 


c 


614 


50.8 


2 


.2 


38383 


3 


CEF35H8 


Z36752 Caenorhabdi 


c 


615 


50.8 


2 


.2 


146839 


2 


AC142006 


AC142006 


Rattus no 


c 


616 


50.6 


2 


.2 


1105 


8 


AK060146 


AK060146 


Oryza sat 


c 


617 


50.6 


2 


.2 


124366 


8 


AC131374 


AC131374 


Oryza sat 


c 


618 


50.6 


2 


.2 


300029 


8 


AE017077 


AE017077 


Oryza sat 




619 


50...4 


2 


.2 


1391 


6 


CQ774188 


CQ77J11.88 


Sequence 




620 


50.4 


2 


.2 


1494 


6 


AX652909 


AX652909 


Sequence 




621 


50.4 


2 


.2 


1733 


8 


AK106250 


AK106250 


Oryza sat 




622 


50.4 


2 


.2 


1795 


8 


AK066593 


AK066593 


Oryza sat 




623 


50.4 


2 


.2 


1999 


6 


AX756082 


AX756082 


Sequence 




624 


50.4 


2 


.2 


1999 


8 


AK066469 


AK066469 


Oryza sat 




625 


50.4 


2 


.2 


131616 


6 


CQ774045 


CQ774045 


Sequence 


c 


626 


50. 4 


2 


.2 


131680 


14 


AY522332 


AY522332 Agrotis s 


c 


627 


50.4 


2 


.2 


147472 


8 


AP003974 


AP003974 


Oryza sat 


c 


628 


50.4 


2 


.2 


147739 


8 


AP005915 


AP005915 


Oryza sat 




629 


50.2 


2 


.2 


1577 


8 


AK107669 


AK107669 


Oryza sat 




630 


50.2 


2 


.2 


1662 


8 


AB070754 


AB070754 


Vigna ang 





631 


50.2 


2 . 


2 


1843 


8 


AK065112 


AK065112 Oryza sat 




632 


50.2 


2. 


2 


58309 


8 


AP006173 


AP006173 Oryza sat 


c 


633 


50.2 


2. 


2 


185449 


8 


AC123526 


AC123526 Oryza sat 




634 


49.8 


2. 


1 


1086 


8 


AY257215 


AY257215 Ipomoea a 




635 


49.8 


2 . 


1 


6322 


8 


AF331854 


AF331854 Zea mays 


c 


636 


49. 8 


2 . 


1 


140023 


4 


AC093453 


AGO 934 53 Canis fam 




637 


49.6 


2. 


1 


563 


6 


CQ505416 


CQ505416 Sequence 




638 


49.4 


2. 


1 


1578 


6 


E12713 


E12713 Solanum mel 




639 


49.4 


2. 


1 


1578 


8 


SMGT 


X77369 S.melongena 


c 


640 


49.4 


2. 


1 


2850 


14 


SLPVEGT 


X99073 Spodoptera 




641 


49.2 


2. 


1 


656 


6 


AX201693 


AX201693 Sequence 




642 


49.2 


2 . 


1 


657 


6 


AR307005 


AR307005 Sequence 




643 


49.2 


2 . 


1 


657 


6 


AX154755 


AX154755 Sequence 




644 


49.2 


2. 


1 


1109 


3 


AY069046 


AY069046 Drosophil 




645 


49.2 


2i 


1 


1530 


6 


CQ612555 


CQ612555 Sequence 




646 


49.2 


2. 


1 


1752 


6 


CQ600621 


CQ600621 Sequence 




647 


49.2 


2. 


1 


3679 


6 


CQ612554 


CQ612554 Sequence 


c 


648 


49.2 


2. 


1 


5189 


14 


SLAJ3131 


AJ003131 Spodopter 




649 


49 


2. 


1 


501 


6 


CQ475453 


CQ475453 Sequence 




650 


49 


2. 


1 


18314 


3 


AF125959 


AF125959 Caenorhab 




651 


49 


2. 


1 


158826 


8 


AP003607 


AP003607 Oryza sat 


c 


652 


49 


2. 


1 


175667 


8 


AC119147 


AC119147 Genomic s 


c 


653 


49 


2. 


1 


189057 


2 


BX957337 


BX957337 Danio rer 




654 


48.8 


2. 


1 


1141 


6 


AX083744 


AX083744 Sequence 




655 


48.8 


2. 


1 


1496 


8 


AY262037 


AY262037 Crocus sa 




656 


48.8 


2. 


1 


1689 


14 


LDU04321 


U04321 Lymantria d 


c 


657 


48.6 


2. 


1 


666 


6 


M660993 


AX660993 Sequence 




658 


48.6 


2. 


1 


1001 


6 


AR284354 


AR284354 Sequence 




659 


48. 6 


2. 


1 


1501 


8 


AK059016 


AK059016 Oryza sat 




660 


48.6 


2. 


1 


1684 


8 


AB070755 


AB070755 Vigna ang 


c 


661 


48.6 


2. 


1 


2000 


6 


AX655393 


AX655393 Sequence 




662 


48.6 


2. 


1 


2031 


8 


AK067710 


AK067710 Oryza sat 




663 


48.4 


2. 


1 


789 


6 


AX660822 


AX660822 Sequence 




664 


48.4 


2. 


1 


1437 


6 


AX505505 


AX505505 Sequence 




665 


48.4 


2. 


1 


1455 


6 


AX654659 


AX654659 Sequence 




666 


48.4 


2. 


1 


1464 


8 


BT015770 


BT015770 Arabidops 




667 


48.4 


2. 


1 


1724 


8 


AY049277 


AY049277 Arabidops 




668 


48.4 


2. 


1 


1727 


8 


AK071819 


AK071819 Oryza sat 




669 


48.4 


2. 


1 


1781 


14 


MBU41999 


U41999 Mamestra br 




670 


48.4 


2. 


1 


2396 


6 


E25788 


E25788 Ecdysteroid 


c 


671 


48.4 


2. 


1 


84196 


8 


ATT3A5 


AL132979 Arabidops 


c 


672 


48.4 


2. 


1 


89469 


8 


ATF18B3 


AL049862 Arabidops 


c 


673 


48.4 


2. 


1 


110086 


8 


AP003373 


AP003373 Oryza sat 




674 


48.4 


2. 


1 


153656 


14 


AF539999 


AF539999 Mamestra 




675 


48.4 


2 . 


1 


155060 


14 


MBU594 61 


U59461 Mamestra co 




676 


48.2 


2. 


1 


,«..ai59 


8 


AY257209 


AY257209 Ipomoea,...p 




677 


48.2 


2 . 


1 


1346 


8 


AY257206 


AY257206 Ipomoea h 


c 


678 


48.2 


2 . 


1 


206924 


10 


AC087780 


AC087780 Mus muscu 




679 


48 


2. 


1 


1689 


8 


AY0B7431 


AY087431 Arabidops 




680 


48 


2. 


1 


4354 


5 


PPL249082 


AJ249082 Pleuronec 


c 


681 


48 


2. 


1 


130037 


8 


AP004192 


AP004192 Oryza sat 


c 


682. 


48 


2. 


1 


139342 


14 


AF325155 


AF325155 Spodopter 


c 


683 


48 


2. 


1 


160215 


10 


AC092531 


AC092531 Rattus no 




684 


48 


2. 


1 


164263 


8 


AP005515 


AP005515 Oryza sat 




685 


48 


2. 


1 


185139 


8 


AP004348 


AP004348 Oryza sat 


c 


686 


48 


2. 


1 


251485 


2 


AC107287 


AC107287 Rattus no 


c 


687 


48 


2. 


1 


256354 


10 


AC120922 


AC120922 Rattus no 





688 


47 . 


8 


2 . 


1 


1138 


6 


AX432482 




689 


47 . 


8 


2 . 


1 


1785 


8 


AK105261 




690 


47 . 


8 


2 . 


1 


1896 


8 


AK101659 




691 


47. 


8 


2 . 


1 


1896 


8 


AK103824 




692 


47. 


8 


2 . 


1 


110000 


1 


AE017333_08 




693 


47 . 


8 


2. 


1 


110000 


1 


CP000002 08 




694 


47. 


8 


2. 


1 


146154 


2 


AP004400 


c 


695 


47. 


8 


2. 


1 


183095 


8 


AP005186 




696 


47. 


6 


2. 


1 


1925 


14 


SLNPVEGT 




697 


47. 


4 


2. 


0 


1410 


6 


AX653349 




698 


47. 


4 


2. 


0 


1729 


8 


AK060965 




699 


47. 


4 


2. 


0 


135323 


8 


CNS08CBQ 




700 


47. 


2 


2. 


0 


1609 


8 


AB070757 


c 


701 


47 


2. 


0 


137678 


8 


AP005183 


c 


702 


47 


2. 


0 


138653 


8 


AP005178 




703 


46. 


8 


2. 


0 


998 


8 


AJ619862 




704 


46. 


8 


2. 


0 


998 


8 


AJ619867 




705 


46. 


8 


2. 


0 


998 


8 


AJ619868 




706 


46. 


8 


2. 


0 


998 


8 


AJ619875 




707 


46. 


8 


2. 


0 


998 


8 


AJ619876 




708 


46. 


8 


2. 


0 


998 


8 


AJ619877 




709 


46. 


8 


2. 


0 


998 


8 


AJ619878 




710 


46. 


8 


2. 


0 


1000 


8 


AJ619869 




711 


46. 


8 


2. 


0 


1000 


8 


AJ619870 




712 


46. 


8 


2. 


0 


1000 


8 


AJ619871 




713 


46. 


8 


2. 


0 


1000 


8 


AJ619872 




714 


46. 


8 


2. 


0 


1000 


8 


AJ619873 




715 


46. 


8 


2. 


0 


1000 


8 


AJ619879 




716 


46. 


8 


2. 


0 


1000 


8 


AJ619880 




717 


46. 


8 


2. 


0 


1002 


8 


AJ619874 




718 


46. 


8 


2. 


0 


158482 


14 


AY126275 




719 


46. 


6 


2. 


0 


1494 


8 


AF028237 




720 


46. 


6 


2. 


0 


2531 


8 


ZMMC2BZ1 




721 


46. 


6 


2. 


0 


2919 


8 


ZMMCCBZl 




722 


46. 


6 


2. 


0 


4294 


8 


AY191009 




723 


46. 


6 


2. 


0 


5772 


8 


AF355378 




724 


46. 


6 


2. 


0 


41355 


3 


U42832 




725 


46. 


6 


2. 


0 


106186 


8 


AF448416 




726 


46. 


6 


2. 


0 


146949 


2 


AL360003 




727 


46. 


6 


2. 


0 


150771 


9 


AL392106 




728 


46. 


6 


2. 


0 


226001 


8 


AF391808 


c 


729 


46. 


6 


2. 


0 


236362 


2 


AC006724 




730 


46. 


4 


2. 


0 


470 


11 


BX322426 




731 


46. 


4 


2. 


0 


1440 


6 


AX211632 


c 


732 


46. 


4 


2. 


0 


1440 


6 


AX211634 




733 


46. 


4 


2. 


0 


1440 




.AX505385 




734 


46. 


4 . 


2. 


0 


1471 


8 


AY142676 




735 


46. 


4 


2. 


0 


1599 


8 


AK065987 




736 


46. 


4 


2. 


0 


1615 


8 


AY074339 




737 


46. 


4 


2. 


0 


1621 


8 


AY057646 




738 


46. 


4 


2. 


0 


1645 


8 


AK105912 




739 


46. 


4 


2. 


0 


1645 


8 


AK119544 


c 


740 


46. 


4 


2. 


0 


1657 


8 


AY099833 




741 


46. 


4 


2. 


0 


1683 


8 


AK106006 




742 


46. 


4 


2. 


0 


138141 


8 


AP005190 


c 


743 


46. 


4 


2. 


0 


168354 


8 


AP005185 


c 


744 


46. 


4 


2. 


0 


197419 


8 


ATCHRIV41 



AX432482 Sequence 
AK105261 Oryza sat 
AK101659 Oryza sat 
AK103824 Oryza sat 
Continuation (9 of 
Continuation {9 of 
AP004400 Oryza sat 
AP005186 Oryza sat 

X84701 Spodoptera 
AX653349 Sequence 
AK060965 Oryza sat 
AL928753 Oryza sat 
AB070757 Vigna ang 
AP005183 Oryza sat 
AP005178 Oryza sat 
AJ619862 Arabidops 
AJ619867 Arabidops 
AJ619868 Arabidops 
AJ619875 Arabidops 
AJ619876 Arabidops 
AJ619877 Arabidops 
AJ619878 Arabidops 
AJ619869 Arabidops 
AJ619870 Arabidops 
AJ619871 Arabidops 
AJ619872 Arabidops 
AJ619873 Arabidops 
AJ619879 Arabidops 
AJ619880 Arabidops 
AJ619874 Arabidops 

AY126275 Mamestra 
AF028237 Ipomoea p 
X13501 Maize (Bz-M 
X13500 Zea mays (B 
AY191009 Zea mays 
AF355378 Zea mays 
U42832 Caenorhabdi 
AF448416 Zea mays 
AL360003 Homo sapi 
AL392106 Human DNA 
AF391808 Zea mays 
AC006724 Caenorhab 

BX322426 Arabidops 
AX211632 Sequence 
AX211634 Sequence 
AX505385 Sequence 
AY142676 Arabidops 
AK065987 Oryza sat 
AY074339 Arabidops 
AY057646 Arabidops 
AK105912 Oryza sat 
AK119544 Oryza sat 
AY099833 Arabidops 
AK106006 Oryza sat 
AP005190 Oryza sat 
AP005185 Oryza sat 
AL161541 Arabidops 



c 


745 


46.4 


2 . 


0 


205065 


8 


ATFCA4 


Z97339 Arabidopsis 


c 


746 


46.4 


2. 


0 


250029- 


3 


AE014820 


AE014820 Plasmodiu 


c 


747 


46.4 


2. 


0 


287560 


1 


AE017274 


AE017274 Bacillus 




748 


46.2 


2. 


0 


467 


11 


BV008367 


BV008367 MASC_STS1 


c 


749 


46.2 


2 . 


0 


490 


6 


CQ431226 


CQ431226 Sequence 




750 


46.2 


2. 


0 


1458 


6 


AX654445 


AX654445 Sequence 




751 


46.2 


2. 


0 


1464 


6 


AX653704 


AX653704 Sequence 


c 


752 


46.2 


2. 


0 


1638 


8 


AK105478 


AK105478 Oryza sat 




753 


46.2 


2. 


0 


1663 


8 


AK062590 


AK062590 Oryza sat 




754 


46.2 


2. 


0 


1914 


8 


AK068586 


AK068586 Oryza sat 


c 


755 


46.2 


2. 


0 


134534 


2 


AP004327 


AP004327 Oryza sat 


c 


756 


46.2 


2. 


0 


140769 


8 


AP003508 


AP003508 Oryza sat 


c 


757 


46.2 


2. 


0 


141036 


2 


AP003941 


AP003941 Oryza sat 


c 


758 


46.2 


2. 


0 


150455 


8 


AP004571 


AP004571 Oryza sat 


c 


759 


46.2 


2. 


0 


162520 


8 


AP005913 


AP005913 Oryza sat 


c 


760 


46.2 


2. 


0 


165630 


8 


AP003617 


AP003617 Oryza sat 


c 


761 


46.2 


2. 


0 


166005 


8 


AC120506 


AC120506 Oryza sat 


c 


762 


46.2 


2. 


0 


194284 


5 


AL954329 


AL954329 Zebrafish 




763 


46 


2. 


0 


1259 


8 


MECGT6 


X77463 M.esculenta 




764 


46 


2. 


0 


1477 


6 


CQ595878 


CQ595878 Sequence 




765 


46 


2. 


0 


1553 


8 


AK060228 


AK060228 Oryza sat 




766 


46 


2. 


0 


1680 


8 


AK069049 


AK069049 Oryza sat 


c 


767 


46 


2. 


0 


1852 


8 


AK105674 


AK105674 Oryza sat 




768 


46 


2. 


0 


3528 


6 


CQ595877 


CQ595877 Sequence 


c 


769 


46 


2. 


0 


20493 


2 


AC014076 


AC014076 Drosophil 


c 


770 


46 


2. 


0 


138141 


8 


AP005190 


AP005190 Oryza sat 




771 


46 


2. 


0 


148780 


3 


AC008326 


AC008326 Drosophil 


c 


772 


46 


2. 


0 


156983 


8 


OSJN00212 


AL663014 Oryza sat 


c 


773 


46 


2. 


0 


164263 


8 


AP005515 


AP005515 Oryza sat 


c 


774 


46 


2. 


0 


168759 


8 


AC079887 


AC079887 Oryza sat 




775 


46 


2. 


0 


267997 


3 


AE003616 


AE003616 Drosophil 




776 


45.8 


2. 


0 


1395 


6 


AX654788 


AX654788 Sequence 




111 


45.8 


2. 


0 


1783 


8 


AK108674 


AK108674 Oryza sat 




HQ 


45.8 


2. 


0 


5908 


1 


AF147704 


AF147704 Streptomy 




779 


45.8 


2. 


0 


13511 


6 


AX345183 


AX345183 Sequence 




780 


45.8 


2. 


0 


18585 


6 


AX281498 


AX281498 Sequence 


c 


781 


45.8 


2. 


0 


138155 


2 


AC151564 


AC151564 Dasypus n 


c 


782 


45.8 


2. 


0 


191765 


8 


AC129717 


AC129717 Oryza sat 




783 


45.6 


2. 


0 


1510 


8 


AK121682 


AK121682 Oryza sat 




784 


45.6 


2. 


0 


1639 


8 


AK105398 


AK105398 Oryza sat 


c 


785 


45.6 


2. 


0 


1689 


8 


AK063307 


AK063307 Oryza sat 




786 


45.6 


2. 


0 


1750 


8 


AK119905 


AK119905 Oryza sat 




787 


45.6 


2. 


0 


32784 


3 


CEF08G5 


Z70682 Caenorhabdi 


c 


788 


45.6 


2. 


0 


97352 


8 


AC091670 


AC091670 Oryza sat 




789 


45.6 


2. 


0 


133889 


8 


AC133334 


AC133334 Oryza sat 




790 


45.4 


2. 


0 


1688 


8 


AK107291^..-. 


AK107291 Oryza sat 




791 


45.4 


2. 


0 


1706 


8 


AK072018 


AK072018 Oryza sat 




792 


45.4 


2. 


0 


105227 


8 


AP005296 


AP005296 Oryza sat 




793 


45.4 


2. 


0 


119926 


8 


AP003705 


AP003705 Oryza sat 




794 


45.4 


2. 


0 


135611 


14 


AF169823 


AF169823 Spodopter 




795 


45.4 


2. 


0 


137967 


2 


AC093094 


AC093094 Oryza sat 


c 


796 


45.4 


2. 


0 


165307 


2 


AC123749 


AC123749 Mus muscu 




797 


45.2 


1. 


9 


994 


8 


AJ619855 


AJ619855 Arabidops 




798 


45.2 


1. 


9 


994 


8 


AJ619856 


AJ619856 Arabidops 




799 


45.2 


1. 


9 


1383 


6 


AX653292 


AX653292 Sequence 




800 


45.2 


1. 


9 


1452 


6 


AX653929 


AX653929 Sequence 


c 


801 


45.2 


1. 


9 


1463 


8 


AK108695 


AK108695 Oryza sat 





802 


45.2 


1. 


9 


1549 


6 


AX660242 


AX660242 Sequence 




803 


45.2 


1. 


9 


1603 


8 


AK107156 


AK107156 Oryza sat 




804 


45.2 


1. 


9 


1736 


8 


BT009372 


BT009372 Triticum 




805 


45.2 


1. 


9 


1809 


8 


AK103303 


AK103303 Oryza sat 




806 


45.2 


1. 


9 


2000 


6 


AX656006 


AX656006 Sequence 


c 


807 


45.2 


1. 


9 


157987 


8 


AP,003560 


AP003560 Oryza sat 




808 


45.2 


1. 


9 


163095 


8 


AP002523 


AP002523 Oryza sat 




809 


45 


1. 


9 


45 


6 


AX697217 


AX697217 Sequence 




810 


45 


1. 


9 


955 


8 


AB012115 


7^012115 Vigna man 




811 


45 


1. 


9 


1521 


8 


AY167672 


AY167672 Zea mays 




812 


45 


1. 


9 


1521 


8 


AY167673 


AY167673 Zea mays 




813 


45 


1. 


9 


1521 


8 


AY167675 


AY167675 Zea mays 




814 


45 


1. 


9 


1521 


8 


AY167676 


AY167676 Zea mays 




815 


45 


1. 


9 


1521 


8 


AY167679 


AY167679 Zea mays 




816 


45 


1. 


9 


1536 


6 


E49073 


E49073 UDP-D-Gluco 




817 


45 


1. 


9 


1561 


8 


AB191249 


AB191249 Dianthus 




818 


45 


1. 


9 


1575 


8 


AB002818 


AB002818 Perilla f 




819 


45 


1. 


9 


1594 


8 


ZMW22BZ1 


X13502 Maize (Bz-W 




820 


45 


1. 


9 


1732 


6 


E49068 


E49068 UDP-D-Gluco 




821 


45 


1. 


9 


1732 


8 


AB033758 


AB033758 Citrus un 




822 


45 


1. 


9 


2908 


8 


ZMBZW22 


X07937 Maize Bz-W2 




823 


45 


1. 


9 


3181 


8 


ZMBZR 


X07941 Maize bronz 


c 


824 


45 


1. 


9 


37329 


3 


U97009 


U97009 Caenorhabdi 




825 


45 


1. 


9 


103960 


8 


ATAC011664 


ACG11664 Arabidops 




826 


45 


1. 


9 


147472 


8 


AP003974 


AP003974 Oryza sat 


c 


827 


45 


1. 


9 


149227 


2 


AC146976 


AC146976 Zea mays 


c 


828 


45 ^ 


1. 


9 


179810 


9 


AC139345 


AC139345 Papio anu 


c 


829 


45 


1. 


9 


288479 


2 


AC146814 


AC146814 Zea mays 


c 


830 


44.8 


1. 


9 


774 


6 


BD019848 


BD019848 Novel gen 


c 


831 


44.8 


1. 


9 


774 


6 


BD099786 


BD099786 Novel gen 




832 


44.8 


1. 


9 


858 


10 


RATUGT1B3G 


D38067 Rattus norv 




833 


44.8 


1. 


9 


995 


8 


AJ619882 


AJ619882 Arabidops 




834 


44.8 


1. 


9 


995 


8 


AJ619883 


AJ619883 Arabidops 




835 


44.8 


1. 


9 


1362 


6 


AX412617 


AX412617 Sequence 




836 


44.8 


1. 


9 


1362 


6 


AX507722 


AX507722 Sequence 




837 


44.8 


1. 


9 


1362 


6 


AX589823 


7\X589823 Sequence 




838 


44.8 


1. 


9 


1362 


6 


AX652007 


AX652007 Sequence 




839 


44.8 


1. 


9 


1521 


8 


AK110892 


73iK110892 Oryza sat 




840 


44.8 


1. 


9 


1787 


8 


AF303396 


AF303396 Phaseolus 


c 


841 


44.8 


1. 


9 


78089 


3 


AC024876 


AC024876 Caenorhab 


c 


842 


44.8 


1. 


9 


95824 


9 


AC068888 


AC068888 Homo sapi 


c 


843 


44.8 


1. 


9 


103223 


8 


AC007153 


AC007153 Arabidops 




844 


44.8 


1. 


9 


110000 


2 


AC109940 0 


AC109940 Rattus no 




845 


44.8 


1. 


9 


192969 


2 


AC150463 


AC150463 Callithri 




846 


44.8 


1. 


9 


221618 


10 


AC092530 


AC092530 Rattus no 




847 


44.8 


1. 


9 


224007 


2 


AC111764 


AC111764 Rattus no 


c 


848 


44.8 


1. 


9 


298804 


2 


AC006911 


AC006911 Caenorhab 




849 


44.8 


1. 


9 


299081 


2 


AC006892 


AC006892 Caenorhab 




850 


44.6 


1. 


9 


762 


6 


BD224494 


BD224494 Materials 




851 


44.6 


1. 


9 


762 


6 


AR216544 


AR216544 Sequence 


c 


852 


44.6 


1. 


9 


1141 


6 


AX083744 


AX083744 Sequence 




853 


44.6 


1. 


9 


1558 


8 


AB070748 


AB070748 Vigna ang 




854 


44.6 


1. 


9 


1662 


8 


AK064105 


AK064105 Oryza sat 




855 


44.6 


1. 


9 


2591 


8 


AK106639 


AK106639 Oryza sat 




856 


44.6 


1. 


9 


110000 


1 


AE017263 2 


Continuation (3 of 


c 


857 


44.6 


1. 


9 


159861 


8 


AP005012 


AP005012 Oryza sat 




858 


44.6 


1. 


9 


287476 


1 


AE017271 


AE017271 Bacillus 



oca 

859 


A A A 

44.4 


1 . 


9 


1175 


8 


AB074489 


AB074489 Malus x d 


o ^ r\ 

860 


A A A 

44.4 


1 . 


9 


54025 


8 


AP006525 


AP006525 Oryza sat 


ool 


A A 0 


1 . 


9 


1666 


8 


AK106038 


AK106038 Oryza sat 


862 


44 . 2 


1 . 


9 


133427 


2 


AC134521 


AC134521 Medicago 


863 


A A 0 

44 . 2 


1 . 


9 


144271 


8 


AC098682 


AC098682 Genomic s 


864 


44 . 2 


1 . 


9 


302524 


8 


AE017075 


AE017075 Oryza sat 


865 


44 


1 . 


9 


1895 


8 


AK108846 


AK108846 Oryza sat 


866 


44 


1 . 


9 


3986 


8 


AY490797 


AY490797 Helianthu 


867 


44 


1. 


9 


4556 


9 


AF153832S3 


AF153834 Homo sapi 


800 


A A 

44 


1 . 


9 


T >i yi 0 ft 0 

144392 


9 


HS67K17 


AL023584 Human DNA 


a /T f\ 

869 


44 


1 . 


9 


148762 


8 


AP002843 


AP002843 Oryza sat 


870 


44 


1 . 


9 


172752 


2 


AC116496 


AC116496 Mus muscu 


871 


44 


1 . 


9 


236164 


2 


AC113057 


AC113057 Mus muscu 


872 


43 . 8 


1 . 


9 


432 


6 


A63526 


A63526 Sequence 7 


873 


43.8 


1. 


9 


1152 


6 


AX653506 


AX653506 Sequence 


874 


43.8 


1. 


9 


1482 


6 


AX653177 


AX653177 Sequence 


875 


43 . 8 


1 . 


9 


1654 


8 


AK121725 


AK121725 Oryza sat 


876 


43. 8 


1. 


9 


1660 


8 


AB070745 


AB070745 Vigna ang 


Qll 


43.8 


1. 


9 


1726 


8 


AK064395 


AK064395 Oryza sat 


878 


43.8 


1 . 


9 


1798 


8 


AY663785 


AY663785 Fragaria 


879 


43.8 


1 . 


9 


1958 


8 


AY171598 


AY171598 Fragaria 


880 


43 . 8 


1 . 


9 


37845 


3 


CEF01D4 


Z81054 Caenorhabdi 


881 


43 . 8 


1. 


9 


46394 


3 


CBRG17D06 


AC084511 Caenorhab 


882 


43.8 


1. 


9 


134534 


2 


AP004327 


AP004327 Oryza sat 


883 


43. 8 


1. 


9 


141036 


2 


AP003941 


AP003941 Oryza sat 


884 


43.8 


1. 


9 


165630 


8 


AP003617 


AP003617 Oryza sat 


885 


43 . 8 


1 . 


9 


182870 


3 


AC116960 


AC116960 Dictyoste 


886 


43 . 8 


1 . 


9 


184645 


2 


AC108395 


AC108395 Mus muscu 


887 


43. 8 


1 . 


9 


349954 


6 


AX196297 


AX196297 Sequence 


888 


43 . 6 


1 . 


9 


617 


12 


AY199453 


AY199453 Arabidops 


889 


43. 6 


1. 


9 


993 


8 


AJ619881 


AJ619881 Arabidops 


890 


43. 6 


1. 


9 


993 


8 


AJ619884 


AJ619884 Arabidops 


891 


43 . 6 


1 . 


9 


993 


8 


AJ619885 


AJ619885 Arabidops 


892 


43. 6 


1. 


9 


994 


8 


AJ619857 


AJ619857 Arabidops 


893 


43 . 6 


1. 


9 


994 


8 


AJ619858 


AJ619858 Arabidops 


894 


43 . 6 


1 . 


9 


994 


8 


AJ619859 


AJ619859 Arabidops 


895 


43. 6 


1. 


9 


994 


8 


AJ619860 


. AJ619860 Arabidops 


896 


43 . 6 


1. 


9 


994 


8 


AJ619861 


AJ619861 Arabidops 


897 


43.6 


1. 


9 


994 


8 


AJ619863 


AJ619863 Arabidops 


898 


43 . 6 


1 . 


9 


994 


8 


AJ619864 


AJ619864 Arabidops 


899 


43 . 6 


1 . 


9 


994 


8 


AJ619865 


AJ619865 Arabidops 


900 


43 . 6 


1 . 


9 


994 


8 


AJ619866 


AJ6198 66 Arabidops 


901 


43 . 6 


1 . 


9 


1001 


6 


AR284363 


AR284363 Sequence 


902 


43 . 6 


1 . 


9 


1001 


6 


AR284364 


AR284364 Sequence 


903 


43 . 6 


. 1 . 


9 


1001 


6 


AR284365 


AR284365 Sequence 


904 


4.3.. 6 


1 . 


9 


1001 


6 


AR284367 


AR28.4367 Sequence 


905 


43 . 6 


1. 


9 


1021 


6 


BD229240 


BD229240 Genotype 


906 


43 . 6 


1 . 


9 


1021 


6 


AR349492 


AR349492 Sequence 


907 


43 . 6 


1 . 


9 


1073 


8 


MECGT2 


X77461 M.esculenta 


908 


43 . 6 


1 . 


9 


1350 


6 


AX412670 


AX412670 Sequence 


909 


43 . 6 


1. 


9 


1350 


6 


AX507107 


AX507107 Sequence 


910 


43 . 6 


1 . 


9 


1350 


6 


AX651538 


AX651538 Sequence 


Q1 1 
^ XL 


4o . 0 


1 . 




±OOU 






BT01032 / Arabidops 


912 


43.6 


1. 


9 


1437 


6 


CQ759478 


CQ759478 Sequence 


913 


43.6 


1. 


9 


1437 


6 


AX211615 


AX211615 Sequence 


914 


43.6 


1. 


9 


1527 


8 


AY087340 


AY087340 Arabidops 


915 


43.6 


1. 


9 


1565 


8 


AY087866 


AY087866 Arabidops 





916 


43.6 


1. 


9 


1582 


8 


AY062483 


c 


917 


43.6 


1. 


9 


98734 


8 


AC002333 




918 


43 . 6 


1. 


9 


103785 


8 


ATF1I16 




919 


43.6 


1. 


9 


110036 


8 


AC142095 


c 


920 


43.6 


1. 


9 


203301 


2 


AC147067 




921 


43 . 4 


1. 


9 


929 


8 


AK059765 




922 


43.4 


1. 


9 


1359 


6 


AX654630 




923 


43.4 


1. 


9 


1521 


8 


AY167674 




924 


43.4 


1. 


9 


1521 


8 


AY167677 




925 


43.4 


1. 


9 


1521 


8 


AY167678 




926 


43.4 


1. 


9 


1604 


8 


AK106312 




927 


43.4 


1. 


9 


1621 


8 


AK100189 




928 


43.4 


1. 


9 


4296 


8 


AK068604 




929 


43.4 


1. 


9 


8867 


8 


AY191010 




930 


43.4 


1. 


9 


114283 


2 


AC149822 




931 


43.4 


1. 


9 


148191 


8 


OSJN00067 




932 


43.4 


1. 


9 


149049 


8 


AP004340 


c 


933 


43.4 


1. 


9 


157452 


2 


AC087800 




934 


43.4 


1. 


9 


163032 


8 


AP004263 




935 


43.2 


1. 


9 


1001 


6 


AR284366 




936 


43.2 


1. 


9 


1546 


8 


AK068362 




937 


43.2 


1. 


9 


10329 


6 


AX347024 


c 


938 


43.2 


1. 


9 


43154 


3 


U55369 


c 


939 


43.2 


1. 


9 


103234 


2 


AP003997 


c 


940 


43.2 


1. 


9 


128955 


8 


AP003754 


c 


941 


43.2 


1. 


9 


141040 


8 


AP004382 


c 


942 


43.2 


1. 


9 


146951 


8 


AP003270 


c 


943 


43.2 


1. 


9 


157248 


8 


AP004300 




944 


43.2 


1. 


9 


226077 


2 


AC013296 




945 


43.2 


1. 


9 


229569 


2 


AC107424 


c 


946 


43.2 


1. 


9 


251131 


2 


AC111217 




947 


43.2 


1. 


9 


349980 


6 


AX344569 




948 


43.2 


1. 


9 


349980 


6 


AX344570 




949 


43.2 


1. 


9 


349980 


6 


AX344573 




950 


43.2 


1. 


9 


349980 


6 


AX344574 


c 


951 


43 


1. 


9 


340 


8 


AY027263 




952 


43 


1. 


9 


393 


6 


AX660985 


c 


953 


43 


1. 


9 


469 


11 


BV009002 


c 


954 


43 


1. 


9 


519 


11 


BV008995 




955 


43 


1. 


9 


1350 


6 


CQ759477 




956 


43 


1. 


9 


1440 


8 


AF190634 




957 


43 


1. 


9 


110000 


1 


AE017180 09 




958 


43 


1. 


9 


142376 


8 


AF503433 




959 


42. 8 


1. 


8 


1476 


6 


AX653505 




960 


42. 8 


1. 


8 


1597 


8 


AK068878 




961 


42.8 


1. 


8^ 


1637 


14 


AF313417 




962 


42.8 


1. 


8 


1819 


8 


AF117267 


c 


963 


42.8 


1. 


8 


35033 


3 


AF067615 


c 


964 


42.8 


1. 


8 


100999 


14 


AF270937 


c 


965 


42.8 


1. 


8 


124526 


8 


AC146585 




966 


42.8 


1. 


8 


131689 


9 


AC010588 




967 


42.8 


1. 


8 


161658 


9 


AC008505 


c 


968 


42.8 


1. 


8 


168753 


9 


AC006120 


c 


969 


42.8 


1. 


8 


276829 


2 


AC006741 




970 


42.6 


1. 


8 


432 


5 


PFL291986 




971 


42.6 


1. 


8 


1368 


6 


AX766275 




972 


42.6 


1. 


8 


1760 


8 


AK099049 



AY062483 Arabidops 
AC002333 Arabidops 
AL161667 Arabidops 
AC142095 Medicago 
AC147067 Homo sapi 
AK059765 Oryza sat 
AX654630 Sequence 
AY167674 Zea mays 
AY167677 Zea mays 
AY167678 Zea mays 
AK106312 Oryza sat 
AK100189 Oryza sat 
AK068604 Oryza sat 
AY191010 Zea mays 
AC149822 Zea mays 
AL606602 Oryza sat 
AP004340 Oryza sat 
AC087800 Mus muscu 
AP004263 Oryza sat 
AR284366 Sequence 
AK068362 Oryza sat 
AX347024 Sequence 
U55369 Caenorhabdi 
AP003997 Oryza sat 
AP003754 Oryza sat 
AP004382 Oryza sat 
AP003270 Oryza sat 
AP004300 Oryza sat 
AC013296 Homo sapi 
AC107424 Homo sapi 
AC111217 Rattus no 
AX344569 Sequence 
AX344570 Sequence 
AX344573 Sequence 
AX344574 Sequence 
AY027263 Arabidops 
AX660985 Sequence 

BV009002 MASC_STS1 

BV008995 MASC_STS1 
CQ759477 Sequence 
AF190634 Nicotiana 
Continuation (10 o 
AF503433 Sorghum b 
AX653505 Sequence 
AK068878 Oryza sat 

AF313417 Anticarsi 
AF117267 Malus dom 
AF067615 Caenorhab 

AF270937 Plutella 
AC146585 Medicago 
AC010588 Homo sapi 
AC008505 Homo sapi 
AC006120 Homo sapi 
AC006741 Caenorhab 
AJ291986 Platichth 
AX766275 Sequence 
AK099049 Oryza sat 



973 


42 


. 6 


1. 


8 


1761 


8 


AK066462 


AK066462 


Oryza sat 


974 


42 


.6 


1. 


8 


2025 


6 


AR152488 


AR152488 


Sequence 


975 


42 


.6 


1. 


8 


2025 


6 


BD005789 


BD005789 


Biologica 


976 


42 


.6 


1. 


8 


2025 


14 


HZU89958 


U89958 Helicoverpa 


911 


42 


.6 


1. 


8 


2505 


14 


AF503939 


AF503939 Helicover 


978 


42 


.6 


1. 


8 


27908 


3 


CET04H1 


Z78200 Caenorhabdi 


979 


42 


.6 


1. 


8 


31495 


14 


AF275264 


AF275264 Helicover 


c 980 


42 


.6 


1. 


8 


84203 


8 


AC005106 


AC005106 


Genomic s 


c 981 


42 


.6 


1. 


8 


87577 


2 


BX640459 


BX640459 


Danio rer 


982 


42 


.6 


1. 


8 


114078 


2 


BX640498 


BX640498 


Danio rer 


c 983 


42 


.6 


1. 


8 


121889 


5 


BX255952 


BX255952 


Zebrafish 


c 984 


42 


.6 


1. 


8 


122128 


8 


CNS07YPV 


AL731761 


Oryza sat 


985 


42 


.6 


1. 


8 


130759 


14 


AF303045 


AF303045 Helicover 


986 


42 


.6 


1. 


8 


130869 


14 


AF334030 


AF334030 Helicover 


987 


42 


.6 


1. 


8 


131403 


14 


AF271059 


AF271059 Heliocove 


c 988 


42 


.6 


1. 


8 


158158 


8 


AC146522 


AC146522 


Oryza sat 


989 


42 


.6 


1. 


8 


195190 


2 


AC110843 


AC110843 


Rattus no 


c 990 


42 


.6 


1. 


8 


246998 


2. 


CR450707 


CR450707 


Danio rer 


991 


42 


.6 


1. 


8 


250621 


2 


CR792433 


CR792433 


Danio rer 


992 


42 


.4 


1. 


8 


121 


6 


AX266625 


AX266625 


Sequence 


c 993 


42 


.4 


1. 


8 


121 


6 


AX266626 


AX266626 


Sequence 


994 


42 


.4 


1. 


8 


1410 


6 


AX211623 


AX211623 


Sequence 


c 995 


42 


.4 


1. 


8 


1410 


6 


AX211625 


AX211625 


Sequence 


996 


42 


.4 


1. 


8 


1410 


6 


AX720222 


AX720222 


Sequence 


997 


42 


.4 


1. 


8 


1410 


6 


AX766274 


AX766274 


Sequence 


998 


42 


.4 


1. 


8 


1410 


8 


AY078051 


AY078051 


Arabidops 


999 


42 


.4 


1. 


8 


1428 


6 


AX507686 


AX507686 


Sequence 


1000 


42 


.4 


1. 


8 


1428 


8 


BT012573 


BT012573 


Arabidops 


1001 


42 


.4 


1. 


8 


1430 


6 


AX211635 


AX211635 


Sequence 


C1002 


42 


.4 


1. 


8 


1430 


6 


AX211637 


AX21I637 


Sequence 


1003 


42 


.4 


1. 


8 


1437 


8 


AY663786 


AY663786 


Fragaria 


1004 


42 


.4 


1. 


8 


1560 


6 


AX412846 


AX412846 


Sequence 


1005 


42 


.4 


1. 


8 


1563 


6 


AX654346 


AX654346 


Sequence 


1006 


42 


.4 


1. 


8 


1592 


8 


AF196777 


AF196777 


Arabidops 


1007 


42 


.4 


1. 


8 


1597 


8 


AF367358 


AF367358 


Arabidops 


1008 


42 


.4 


1. 


8 


1705 


8 


AK099145 


AK099145 


Oryza sat 


1009 


42 


.4 


1. 


8 


1719 


8 


AK070110 


AK070110 


Oryza sat 


1010 


42 


.4 


1. 


8 


1776 


8 


AK059031 


AK059031 


Oryza sat 


1011 


42 


.4 


1. 


8 


2000 


6 


AX655393 


AX655393 


Sequence 


1012 


42 


.4 


1. 


8 


91841 


10 


AL606971 


AL606971 Mouse DNA 


cl013 


42 


.4 


1. 


8 


126497 


9 


AC019044 


AC019044 


Homo sapi 


C1014 


42 


.4 


1. 


8 


130779 


8 


OSJN00283 


AL731638 


Oryza sat 


C1015 


42 


.4 


1. 


8 


141883 


8 


OSJN00271 


AL731626 


Oryza sat 


C1016 


42 


.4 


1. 


8 


148191 


8 


OSJN00067 


AL606602 


Oryza sat 


C1017 


42 


.4 


1. 


8 


156649 


8 


AC144738 


AC144738 


Oryza sat 


C1018 


42 


.4 


1. 


8 


157648 


Z*.^..BX901886 


BX901886 


Danio rer 


C1019 


42 


.4 


1. 


8 


164896 


2 


CR762432 


CR762432 


Danio rer 


1020 


42 


.4 


1. 


8 


168759 


8 


AC079887 


AC079887 


Oryza sat 


1021 


42 


.4 


1. 


8 


180557 


9 


AC007250 


AC007250 


Homo sapi 


C1022 


42 


.4 


1. 


8 


197061 


10 


AL606914 


AL606914 Mouse DNA 


1023 


42 


.4 


1. 


8 


301450 


1 


AP003185 


AP003185 


Clostridi 


C1024 


42 


.2 


1. 


8 


708 


5 


AF352753 


AF352753 


Oncorhync 


1025 


42 


.2 


1. 


8 


1734 


8 


AK060513 


AK060513 


Oryza sat 


1026 


42 


.2 


1. 


8 


2438 


8 


AF426026 


AF426026 


Piromyces 


1027 


42 


.2 


1. 


8 


11319 


1 


AE010792 


AE010792 


Methanosa 


1028 


42 


.2 


1. 


8 


40862 


6 


AX346975 


AX346975 


Sequence 


1029 


42 


.2 


1. 


8 


70746 


2 


AC150844 


AC150844 


Medicago 



cl030 


42 . 2 


1 . 


8 


75840 


8 


AF527807 


AF527807 Sorghum b 


1031' 


42.2 


1 . 


8 


117124 


8 


AP006360 


AP006360 Lotus cor 


cl032 


42.2 


1 . 


8 


165501 


2 


AC147569 


AC147569 Homo sapi 


1033 


42.2 


1 . 


8 


191821 


9 


AC022616 


AC022616 Homo sapi 


1034 


42 


1 . 


8 


408 


6 


AR496501 


AR496501 Sequence 


1035 


42 


1 . 


8 


408 


6 


AR511783 


AR511783 Sequence 


1036 


42 


1 . 


8 


1521 


14 


AY048771 


AY048771 Borabyx mo 


1037 


42 


1 . 


8 


1800 


6 


AR362648 


AR362648 Sequence 


1038 


42 


1. 


8 


2793 


6 


AR027919 


AR027919 Sequence 


1039 


42 


1. 


8 


2793 


6 


163351 


163351 Sequence 5 


1040 


42 


1. 


8 


2793 


14 


NPHEGT 


M22619 Autographa 


1041 


42 


1. 


8 


14613 


14 


NPHCUECDYS 


M96361 Autographa 


cl042 


42 


1. 


8 


40324 


6 


AX458634 


AX458634 Sequence 


1043 


42 


1. 


8 


50000 


6 


BD187788 


BD187788 A virus i 


cl044 


42 


1. 


8 


67884 


2 


AC115103 


AC115103 Homo sapi 


1045 


42 


1. 


8 


.117951 


9 


AC023049 


AC023049 Homo sapi 


cl046 


42 


1 . 


8 


119217 


14 


AF499596 


AF499596 Phthorima 


1047 


42 


1. 


8 


128413 


14 


NPHT3COMP 


L33180 Bombyx mori 


1048 


42 


1. 


8 


133894 


6 


A48542 


A48542 Sequence 1 


1049 


42 


1. 


8 


133894 


14 


L22858 


L22858 Autographa 


1050 


42 


■ 1. 


8 


149955 


14 


AY077832 


AY077832 Sheeppox 


1051 


42 


1. 


8 


150158 


8 


AC136905 


AC136905 Oryza sat 


1052 


42 


1. 


8 


154298 


8 


AP003504 


AP003504 Oryza sat 


1053 


42 


1. 


8 


156514 


8 


AP003514 


AP003514 Oryza sat 


1054 


42 


1. 


8 


174183 


2 


AC149689 


AC149689 Bos tauru 


1055 


42 


1. 


8 


174500 


8 


AP004738 


AP004738 Oryza sat 


1056 


42 


1. 


8 


182726 


2 


AC084436 


AC084436 Homo sapi 


1057 


42 


1. 


8 


185592 


10 


AL773522 


AL773522 Mouse DNA 


i058 


42 


1. 


8 


320135 


2 


AC151850 


AC151850 Takifugu 


C1059 


41.8 


1. 


8 


424 


6 


AR495496 


AR495496 Sequence 


cl060 


41.8 


1. 


8 


424 


6 


AR510778 


AR510778 Sequence 


1061 


41.8 


1. 


8 


1218 


6 


CQ801246 


CQ801246 Sequence 


1062 


41.8 


1. 


8 


1457 


8 


AF127218 


AF127218 Forsythia 


1063 


41.8 


1. 


8 


1470 


6 


CQ804574 


CQ804574 Sequence 


1064 


41.8 


1. 


8 


1470 


8 


BT008765 


BT008765 Arabidops 


1065 


41.8 


1. 


8 


1605 


8 


AY081339 


AY081339 Arabidops 


1066 


41.8 


1. 


8 


2043 


8 


AB013598 


AB013598 Verbena h 


1067 


41.8 


1. 


8 


2326 


8 


HVBRNZIH 


X15694 Barley bron 


1068 


41.8 


1. 


8 


6174 


6 


CQ801273 


CQ801273 Sequence 


1069 


41.8 


1. 


8 


10325 


6 


CQ801245 


CQ801245 Sequence 


1070 


41.8 


1. 


8 


12134 


6 


CQ801279 


CQ801279 Sequence 


C1071 


41.8 


1. 


8 


50436 


9 


AL391258 


AL391258 Human DNA 


1072 


41.8 


1. 


8 


57121 


2 


AC084255 


AC084255 Homo sapi 


1073 


41.8 


1. 


8 


82139 


3 


AC115684 


AC115684 Dictyoste 


cl074 


41.8 


1. 


8 


97681 


9 


AL138931 


AL138931 Human DNA 


1075 


41.8 


1. 


8 


107200 


8 


AC006551^-. 


AC006551 Arabidops 


1076 


41.8 


1. 


8 


109873 


8 


AP003872 


AP003872 Oryza sat 


cl077 


41.8 


1. 


8 


115248 


5 


BX255908 


BX255908 Zebrafish 


cl078 


41.8 


1. 


8 


131158 


14 


AY327402 


AY327402 Choriston 


1079 


41.8 


1. 


8 


148544 


8 


AP004636 


AP004636 Oryza sat 


cl080 


41.8 


1. 


8 


177320 


2 


AL356777 


AL356777 Homo sapi 


cl081 


41.8 


1. 


8 


192867 


9 


AL358815 


AL358815 Human DNA 


1082 


4 1 . o 


1 . 


8 


194693 


2 


CR753837 


CR753837 Danio rer 


C1083 


41.8 


1. 


8 


233688 


2 


BX571949 


BX571949 Danio rer 


1084 


41.6 


1. 


8 


466 


6 


AR495991 


AR495991 Sequence 


1085 


41.6 


1. 


8 


466 


6 


AR511273 


AR511273 Sequence 


1086 


41.6 


1. 


8 


700 


8 


AY201033 


AY201033 Arabidops 



1087 


41 . 


6 


1 . 


8 


1323 


6 


AX653753 


1088 


41. 


6 


1 . 


8 


1480 


8 


AK118988 


1089 


41. 


6 


1 . 


8 


7238 


6 


AX345350 


1090 


41. 


6 


1. 


8 


103517 


8 


AC073395 


1091 


41. 


6 


1. 


8 


117296 


8 


AC008153 


1092 


41. 


6 


1. 


8 


125283 


8 


AC144345 


C1093 


41. 


6 . 


1. 


8 


125623 


3 


AC115599 


1094 


41. 


6 


1. 


8 


167628 


9 


AC092570 


cl095 


41. 


6 


1. 


8 


197679 


10 


AC087801 


1096 


41. 


6 


1. 


8 


207945 


10 


AC123945 


1097 


41. 


6 


1. 


8 


211456 


10 


AL928926 


cl098 


41. 


6 


1. 


8 


219293 


2 


AC141471 


1099 


41. 


4 


1. 


8 


281 


8 


NTA538414 


1100 


41. 


4 


1. 


8 


468 


11 


BV008994 


1101 


41. 


4 


1. 


8 


525 


11 


BV009008 


1102 


41. 


4 


1. 


8 


1381 


8 


BT005494 


1103 


41. 


4 


1. 


8 


1434 


6 


AX211619 


1104 


41. 


4 


1. 


8 


1507 


8 


AY084687 


1105 


41. 


4 


1. 


8 


1584 


8 


AK071127 


1106 


41. 


4 


1. 


8 


1618 


8 


BT004159 


1107 


41. 


4 


1. 


8 


37329 


3 


U97009 


1108 


41. 


4 


1. 


8 


38951 


3 


CEAC3 


1109 


41. 


4 


1. 


8 


45701 


9 


AC099412 


1110 


41. 


4 


1. 


8 


45868 


8 


AC145456 


cllll 


41. 


4 


1. 


8 


75125 


8 


AB025604 


1112 


41. 


4 


1. 


8 


138870 


8 


AP005659 


1113 


41. 


4 


1. 


8 


145988 


2 


AC024059 


clll4 


41. 


4 


1. 


8 


161586 


9 


AC090179 


clllS 


41. 


4 


1. 


8 


181991 


2 


AC068322 


1116 


41. 


4 


1. 


8 


195806 


9 


AC027243 


C1117 


41. 


4- 


1. 


8 


215251 


2 


AC105505 


1118 


41. 


4 


1. 


8 


236880 


2 


AC125831 


1119 


41. 


4 


1. 


8 


248489 


2 


AC095253 


1120 


41. 


2 


1. 


8 


1374 


6 


CQ805414 


1121 


41. 


2 


1. 


8 


1374 


6 


AX506881 


1122 


41. 


2 


1. 


8 


1374 


6 


AX651852 


1123 


41. 


2 


1. 


8 


1405 


8 


AY117218 


1124 


41. 


2 


1. 


8 


1617 


8 


AY056277 


1125 


41. 


2 


1. 


8 


1732 


8 


AK099055 


1126 


41. 


2 


1. 


8 


1785 


8 


AK071953 


1127 


41. 


2 


1. 


8 


4686 


6 


AX654735 


1128 


41. 


2 


1. 


8 


7829 


6 


AX251889 


1129 


41. 


2 


1. 


8 


7829 


6 


AX344283 


1130 


41. 


2 


1. 


8 


7829 


6 


AX346007 


1131 


41. 


2 


1. 


8 


7829 


6 


AX348698 


... 1132 


41. 


2 


1. 


8 


10710 


6 


AX345795 


cll33 


41. 


2 


1. 


8 


14284 


14 


AF439352 


1134 


41. 


2 


1. 


8 


24768 


3 


CET07C5 


C1135 


41. 


2 


1. 


8 


100665 


8 


AC006533 


cll36 


41. 


2 


1. 


8 


112065 


8 


AC109596 


cll37 


41. 


2 


1. 


8 


124131 


2 


AC016335 


1138 


41. 


2 


1. 


8 


128755 


9 


AC013587 


1139 


41- 


2 


1. 


8 


129564 


9 


AC087302 


C1140 


41. 


2 


1. 


8 


133352 


8 


AC108499 


C1141 


41. 


2 


1. 


8 


138717 


10 


AL929416 


1142 


41. 


2 


1. 


8 


159891 


8 


AP004864 


1143 


41. 


2 


1. 


8 


161624 


2 


AP001493 



AX653753 Sequence 
AK118988 Arabidops 
AX345350 Sequence 
AC073395 Arabidops 
AC008153 Arabidops 
AC144345 Medicago 
AC115599 Dictyoste 
AC092570 Homo sapi 
AC087801 Mus muscu 
AC123945 Mus muscu 
7\L928926 Mouse DNA 
AC141471 Mus muscu 
AJ538414 Nicotiana 
BV008994 MASC_STS1 
BV009008 MASC_STS1 
BT005494 Arabidops 
AX211619 Sequence 
AY084687 Arabidops 
AK071127 Oryza sat 
BT004159 Arabidops 
U97009 Caenorhabdi 
Z71177 Caenorhabdi 
AC099412 Homo sapi 
AC145456 Cicer ari 
AB025604 Arabidops 
AP005659 Oryza sat 
AC024059 Homo sapi 
AC090179 Homo sapi 
AC068322 Homo sapi 
AC027243 Homo sapi 
AC105505 Rattus no 
AC125831 Rattus no 
AC095253 Rattus no 
CQ805414 Sequence 
AX506881 Sequence 
AX651852 Sequence • 
AY117218 Arabidops 
AY056277 Arabidops 
7VK099055 Oryza sat 
AK071953 Oryza sat 
AX654735 Sequence 
AX251889 Sequence 
AX344283 Sequence 
AX346007 Sequence 
AX348698 Sequence 
AX345795 Sequence 

AF439352 Choriston 
Z50006 Caenorhabdi 
AC006533 Arabidops 
AC109596 Oryza sat 
AC016335 Homo sapi 
AC013587 Homo sapi 
AC087302 Homo sapi 
AC108499 Oryza sat 

AL929416 Mouse DNA 
AP004864 Oryza sat 
AP001493 Homo sapi 



cll44 


41.2 


1 . 


8 


167580 


2 


AC146124 


cll45 


41.2 


1 . 


8 


167932 


2 


AC068133 


C1146 


41.2 


1 . 


8 


170270 


2 


AP001548 


1147 


41.2 


1 . 


8 


170413 


2 


AC090405 


cll48 


41.2 


1 . 


8 


172655 


2 


AC104203 


cll49 


41 . 2 


1. 


8 


174463 


5 


BX511007 


1150 


41.2 


1. 


8 


174645 


9 


AC091111 


1151 


41.2 


1. 


8 


175850 


2 


AP001399 


1152 


41.2 


1 . 


8 


180915 


8 


GTAJ10592 


1153 


41.2 


1 . 


8 


186150 


2 


AP001544 


1154 


41.2 


1 . 


8 


188800 


2 


AC015846 


cll55 


41.2 


1 . 


8 


194487 


2 


AP002896 


cll56 


41.2 


1 . 


8 


202699 


2 


BS000628 


cll57 


41.2 


1. 


8 


204120 


8 


AY661659 


1158 


41.2 


1. 


8 


204340 


9 


AC091103 


1159 


41.2 


1. 


8 


209710 


9 


CNSOIDXF 


1160 


41.2 


1. 


8 


230714 


9 


AC147345 


1161 


41.2 


1. 


8 


271793 


2 


BX572620 


cll62 


41.2 


1. 


8 


291421 


2 


AC132939 


1163 


41 


1. 


8 


1423 


8 


AY519364 


1164 


41 


1 . 


8 


1479 


6 


AX653930 


1165 


41 


1. 


8 


2000 


6 


AX656007 


1166 . 


41 


1. 


8 


2111 


14 


AF000009 


cll67 


41 


1. 


8 


110000 


2 


CR388160_1 


C1168 


41 


1. 


8 


135507 


9 


AL590733 


1169 


41 


1. 


8 


146885 


2 


BX957322 


1170 


41 


1. 


8 


159419 


5 


AC144823 


cll71 


41 


1. 


8 


162134 


2 


AC022478 


1172 


41 


1. 


8 


172614 


9 


AL929302 


1173 


41 


1. 


8 


173411 


9 


AC092754 


1174 


41 


1. 


8 


176202 


9 


CNS05TF3 


1175 


41 


1. 


8 


187429 


9 


AC092755 


cll76 


41 


1. 


8 


208770 


2 


CR388155 


1177 


41 


1. 


8 


216567 


2 


BX927074 


1178 


41 


1. 


8 


229637 


2 


CR556716 


1179 


40.8 


1. 


8 


1361 


8 


AB070749 


1180 


40.8 


1 . 


8 


1398 


6 


AX653340 


1181 


40.8 


1 . 


8 


1415 


8 


MECGTl 


C1182 


40.8 


1. 


8 


1581 


8 


AK120020 


1183 


40.8 


1. 


8 


1586 


8 


AK064270 


1184 


40.8 


1. 


8 


1656 


8 


AK068040 


C1185 


40.8 


1. 


8 


1706 


8 


AK108055 


1186 


40.8 


1. 


8 


1962 


8 


AK063325 


cll87 


40.8 


1. 


8 


7554 


3 


AY160096 


cll88 


40.8 


1. 


8 


16841 


9 


AB044136 


1189 


4.0 . 8 


1. 


8 


36846 


2 


AC149390 


cll90 


40.8 


1. 


8 


59940 


2 


AC024414 


1191 


40.8 


1. 


8 


70204 


9 


AC010737 


1192 


40.8 


1. 


8 


77204 


8 


AB046438 


cll93 


40.8 


1. 


8 


84705 


3 


CEY48E1B 


1194 


40.8 


1, 


8 


97352 


8 


AC091670 


1195 


40.8 


1. 


8 


102230 


8 


AP004045 


C1196 


40.8 


1. 


8 


121501 


8 


AC069557 


1197 


40.8 


1. 


8 


132558 


3 


AC025716 


C1198 


40.8 


1. 


8 


133476 


9 


AC008804 


C1199 


40.8 


1. 


8 


133889 


8 


AC133334 


C1200 


40.8 


1. 


8 


149972 


2 


AC101817 



AC146124 Pan trogl 
AC068133 Homo sapi 
AP001548 Homo sapi 
AC090405 Homo sapi 
AC104203 Mus muscu 
BX511007 Zebrafish 
AC091111 Homo sapi 
AP001399 Homo sapi 
AJ010592 Guillardi 
AP001544 Homo sapi 
AC015846 Homo sapi 
AP002896 Homo sapi 
BS000628 Pan trogl 
AY661659 Sorghum b 
AC091103 Homo sapi 
AL139296 Hximan chr 
AC147345 Pan trogl 
BX572620 Danio rer 
AC132939 Mus muscu 
AY519364 Citrus si 
AX653930 Sequence 
AX656007 Sequence 

AF000009 Heliothis 
Continuation (2 of 
AL590733 Human DNA 
BX957322 Danio rer 
AC144823 Danio rer 
AC022478 Homo sapi 
AL929302 Human DNA 
AC092754 Homo sapi 
AL359951 Human chr 
AC092755 Homo sapi 
CR388155 Danio rer 
BX927074 Danio rer 
CR556716 Danio rer 
AB070749 Vigna ang 
AX653340 Sequence 
X77459 M.esculenta 
AK120020 Oryza sat 
AK064270 Oryza sat 
AK068040 Oryza sat 
AK108055 Oryza sat 
AK063325 Oryza sat 
AY160096 Dictyoste 
AB044136 Homo sapi 
ACL49390 Phakopsor 
AC024414 Homo sapi 
AC010737 Homo sapi 
AB04 6438 Arabidops 
Z93393 Caenorhabdi 
AC091670 Oryza sat 
AP004045 Oryza sat 
AC069557 Genomic S 
AC025716 Caenorhab 
AC008804 Homo sapi 
AC133334 Oryza sat 
AC101817 Mus muscu 



1201 


40 . 


8 


1 


. 8 


153297 


2 


AC027558 


AC027558 Homo sapi 


1202 


40 . 


8 


1 


. 8 


178376 


8 


AP005008 


' AP005008 Oryza sat 


cl203 


40 . 


8 


1 


. 8 


190666 


2 


AY245865 


AY245865 Homo sapi 


1204 


40. 


8 


1 


. 8 


198845 


5 


BX510333 


BX510333 Zebrafish 


1205 


40 . 


8 


1 


. 8 


O "5 C ft c 

203595 


2 


CR385054 


CR385054 Danio rer 


1206 


40 . 


8 


1 


. 8 


247275 


10 


AC100736 


AC100736 Mus muscu 


cl207 


40 . 


O 


1 


. 8 


274525 


2 


AC006903 


AC006903 Caenorhab 


1208 


40 . 


8 


1 


. O 


340801 


2 


AC006751 


AC006751 Caenorhab 


cl209 


40 . 


8 


1 


. 8 


348174 


3 


CR382399 


CR382399 Plasmodia 


cl210 


40 . 


8 


1 


. 8 


349960 


6 


AX573241 


AX573241 Sequence 


1211 


40. 


6 


1 


. 8 


1768 


8 


AB098614 


AB098614 Glycyrrhi 


1212 


40. 


6 


1 


. 8 


6325 


3 


DDI012088 


AJ012088 Dictyoste 


1213 


40. 


6 


1 


. 8 


38206 


9 


AL356984 


AL356984 Human DNA 


cl214 


40. 


6 


1 


. 8 


110000 


2 


PFMAL13_04 


Continuation (5 of 


cl215 


40. 


6 


1 


. 8 


143291 


9 


HS163G9 


AL008733 Human DNA 


cl216 


40. 


6 


1 


. 8 


173223 


9 


AP003777 


AP003777 Homo sapi 


1217 


40 . 


6 


1 


. 8 


214480 


2 


AC113517 


AC113517 Mus muscu 


1218 


A f\ 

40 . 


6 


1 


. 8 


222186 


2 


AC120362 


AC120362 Mus muscu 


1219 


40. 


6 


1 


. 8 


257258 


2 


AC105322 


AC105322 Mus muscu 


1220 


40. 


6 


1 


. 8 


293431 


2 


PFMAL13P4 


AL049181 Plasmodiu 


1221 


40. 


6 


1 


. 8 


349980 


6 


AX344551 


AX344551 Sequence 


1222 


40. 


4 


1 


. 7 


404 


3 


MAU42338 


U42338 Meloidogyne 


1223 


40. 


4 


1 


.7 


1293 


8 


BT011794 


BT011794 Arabidops 


1224 


40. 


4 


1 


.7 


1437 


6 


AX652925 


AX652925 Sequence 


1225 


.40 . 


4 


1 


. 7 


1440 


6 


AX766278 


AX766278 Sequence 


1226 


40. 


4 


1 


. 7 


1514 


8 


AK110876 


AK110876 Oryza sat 


1227 


40. 


4 


1 


.7 


1537 


8 


CAR400861 


AJ400861 Cicer ari 


1228 


40 . 


4 


1 


. 7 


1567 


8 


AY625694 


AY625694 Oryza sat 


1229 


40 . 


4 


1 


. 7 


1664 


8 


AK061830 


AK061830 Oryza sat 


1230 


40. 


4 


1 


. 7 


1686 


8 


AK102335 


AK102335 Oryza sat 


1231 


40. 


4 


1 


.7 


1725 


8 


AK064351 


AK064351 Oryza sat 


1232 


40. 


4 


1 


.7 


1779 


8 


AK102415 


AK102415 Oryza sat 


cl233 


40. 


4 


1 


. 7 


59793 


8 


AB025634 


AB025634 Arabidops 


1234 


40. 


4 


1 


.7 


110000 


2 


PFMAL8P1_11 


Continuation (12 o 


1235 


40. 


4 


1 


. 7 


132254 


3 


AC116330 


AC116330 Dictyoste 


1236 


40. 


4 


1 


. 7 


157987 


8 


AP003560 


AP003560 Oryza sat 


1237 


40. 


4 


1 


. 7 


211799 


2 


AC138229 


AC138229 Mus muscu 


1238 


40. 


4 


1 


. 7 


226627 


2 


AC134181 


AC134181 Rattus no 


cl239 


40. 


4 


1 


. 7 


230926 


2 


AC114457 . 


AC114457 Rattus no 


1240 


40. 


4 


1 


. 7 


348034 


3 


CR382400 


CR382400 Plasmodiu 


cl241 


40. 


2 


1 


. 7 


366 


1 


AY430162 


AY430162 Helicobac 


cl242 


40 . 


2 


1 


. 7 


366 


1 


AY430163 


AY430163 Helicobac 


cl243 


40 . 


2 


1 


. 7 


366 


1 


AY430165 


AY430165 Helicobac 


cl244 


40. 


2 


1 


. 7 


805 


11 


BV026811 


BV026811 S212P6802 


1245 


40 . 


2 


1 


. 7 


863 


8 


HVU496572 


AJ496572 Hordeum v 


1246 


40. 


2 


1. 




1383 


6 


CQ806198 


CQ806198 Sequence 


1247 


40. 


2 


1 


. 7 


1399 


8 


AY128739 


AY128739 Arabidops 


1248 


40 . 


2 


1 


. 7 


1402 


8 


AY133752 


AY133752 Arabidops 


1249 


40 . 


2 


1 


. 7 


1418 


8 


AY114654 


AY114654 Arabidops 


1250 


40 . 


2 


1 


. 7 


1426 


8 


AY072325 


AY072325 Arabidops 


1251 


40 . 


2 


1 


. 7 


1492 


8 


AY062589 


AY062589 Arabidops 


1252 


40 . 


2 


1 


. 7 


1565 


8 


AY074526 


AY074526 Arabidops 




4 u . 


£. 


1 


. / 


±X>H. u 






AKiUo /oo Oryza sat 


1254 


40. 


2 


1 


.7 


1665 


8 


AK105966 


AK105966 Oryza sat 


1255 


40. 


2 


1 


.7 


1673 


8 


AK119530 


AK119530 Oryza sat 


1256 


40. 


2 


1 


.7 


36075 


3 


AF025468 


AF025468 Caenorhab 


C1257 


40. 


2 


1 


.7 


96123 


9 


AC114311 


AC114311 Homo sapi 



cl258 


40.2 


1 . 


7 


98101 


8 


ATF2K13 


AL391141 


Arabidops 


1259 


40 . 2 


1 . 


7 


104305 


8 


AC140022 


AC140022 Medicago 


1260 


40 . 2 


1 . 


7 


107014 


8 


OSJN00218 


AL663021 


Oryza sat 


cl261 


40 . 2 


1 . 


7 


107014 


8 


OSJN00218 


AL663021 


Oryza sat 


1262 


40.2 


1 . 


7 


108530 


5 


BX247884 


BX247884 


Zebraf ish 


1263 


40.2 


1 . 


7 


109918 


9 


AC114279 


AC114279 


Homo sapi 


cl264 


40.2 


1 . 


7 


110000 


2 


AC115142_0 


AC115142 


Rattus no 


cl265 


40.2 


1. 


7 


123500 


14 


U53466 


U53466 Cydia pomon 


cl266 


40.2 


1. 


7 


129563 


8 


OSJN00008 


AL606443 


Oryza sat 


cl267 


40.2 


1. 


7 


140691 


9 


AF159056 


AF159056 


Homo sapi 


1268 


40.2 


1. 


7 


143036 


2 


AC015498 


AC015498 


Homo sapi 


cl269 


40.2 


1. 


7 


143205 


8 


AC093018 


AC093018 


Oryza sat 


1270 


40.2 


1. 


7 


145427 


5 


BX276112 


BX276112 


Zebrafish 


1271 


40.2 


1 . 


7 


149662 


14 


AY077834 


AY077834 Sheeppox 


1272 


40.2 


1. 


7 


150057 


14 


AY077833 


AY077833 Sheeppox 


1273 


40.2 


1. 


7 


152009 


2 


AC010802 


AC010802 


Homo sapi 


cl274 


40.2 


1. 


7 


156814 


8 


AP005004 


AP005004 


Oryza sat 


1275 


40.2 


1. 


7 


162281 


9 


AC105395 


AC105395 


Homo sapi 


1276 


40.2 


1. 


7 


171816 


9 


AC006033 


AC006033 


Homo sapi 


1277 


40.2 


1. 


7 


171926 


2 


AC144073 


AC144073 


Macaca mu 


1278 


40.2 


1 . 


7 


173218 


2 


AP004686 


AP004686 


Oryza sat 


1279 


40.2 


1. 


7 


173219 


2 


BX897738 


BX897738 


Danio rer 


cl280 


40.2 


1. 


7 


189631 


2 


AC009920 


AC009920 


Homo sapi 


C1281 


40.2 


1. 


7 


191935 


2 


AC114747 


AC114747 


Homo sapi 


cl282 


40.2 


1. 


7 


198944 


8 


ATCHRIV38 


AL161538 


Arabidops 


cl283 


40.2 


1. 


7 


199138 


2 


CR450798 


CR450798 


Danio rer 


C1284 


40.2 


1. 


7 


200576 


8 


ATFCAO 


Z 97 3 35 Arabidopsis 


1285 


40.2 


1. 


7 


203114 


2 


AC011818 


AC011818 


Homo sapi 


cl286 


40.2 


1. 


7 


229726 


2 


AC109699 


AC109699 


Rattus no 


1287 


40.2 


1. 


7 


235614 


2 


CR450721 


CR450721 


Danio rer 


cl288 


40.2 


1. 


7 


280915 


2 


AC112104 


AC112104 


Rattus no 


1289 


40.2 


1. 


7 


349980 


6 


AX344566 


AX344566 


Sequence 


1290 


40 


1. 


7 


1219 


8 


AK068336 


AK068336 


Oryza sat 


1291 


40 


1. 


7 


1350 


6 


AX653931 


AX653931 


Sequence 


1292 


40 


1- 


7 


1377 


8 


BT002638 


BT002638 


Arabidops 


1293 


40 


1. 


7 


1411 


8 


BT000356 


BT000356 


Arabidops 


1294 


40 


1. 


7 


1457 


6 


AX211618 


AX211618 


Sequence 


1295 


40 


1. 


7 


1562 


8 


AY120731 


AY120731 


Arabidops 


1296 


40 


1. 


7 


1572 


8 


AY048297 


AY048297 


Arabidops 


1297 


40 


1. 


7 


1661 


8 


AK105967 


AK105967 


Oryza sat 


1298 


40 


1. 


7 


1686 


8 


AK060997 


AK060997 


Oryza sat 


1299 


40 


1. 


7 


1696 


8 


AK103242 


AK103242 


Oryza sat 


1300 


40 


1. 


7 


1842 


8 


AK106302 


AK106302 


Oryza sat 


C1301 


40 


1. 


7 


2026 


8 


AK117305 


AK117305 


Arabidops 


1302 


40 


1. 


7 


6029 


6 


AX346895 


AX346895 


Sequence 


1303 


40 


1. 


7 


7631.« 




AX345762 


AX345762 


Sequence 


1304 


40 


1. 


7 


8576 


6 


AX347130 


AX347130 


Sequence 


1305 


40 


1. 


7 


10716 


6 


AX346320 


AX346320 


Sequence 


cl306 


40 


1. 


7 


11985 


9 


BX322534 


BX322534 


Human DNA 


1307 


40 


1. 


7 


17389 


6 


AX346316 


AX346316 


Sequence 


1308 


40 


1. 


7 


27332 


3 


CER11A8 


Z70310 Caenorhabdi 


1309 


40 


1. 


7 


34548 


6 


AX349036 


AX349036 


Sequence 




A n 
4 U 


1 

1 . 


/ 




1 




AJ278573 


Streptomy 


1311 


40 


1. 


7 


94555 


9 


AL589684 


AL589684 


Human DNA 


1312 


40 


1. 


7 


97033 


2 


AC015146 


AC015146 


Drosophil 


C1313 


40 


1. 


7 


100815 


8 


ATF12A12 


AL133314 


TVrabidops 


1314 


40 


1. 


7 


104071 


2 


AL162260 


AL162260 


Homo sapi 



1315 


40 


1 . 


7 


110000 


2 


AC120698 0 


AC120698 Rattus no 


1316 


40 


1 . 


7 


110000 


2 


AC120698_1 


Continuation (2 of 


cl317 


40 


1 . 


7 


110000 


2 


BX276116_07 


Continuation (8 of 


cl318 


40 


1, 


7 


110000 


2 


BX276116_08 


Continuation (9 of 


1319 


40 


1 . 


7 


110000 


3 


AC116984_1 


Continuation (2 of 


1320 


40 


1. 


7 


143508 


9 


HSJ570L12 


AL049589 Human DNA 


1321 


40 


1 . 


7 


160366 


3 


AC069457 


AC069457 Drosophil 


1322 


40 


1. 


7 


160776 


2 


AC137960 


AC137960 Mus muscu 


1323 


40 


1. 


7 


168438 


2 


AC024632 


AC024632 Homo sapi 


1324 


40 


1. 


7 


168953 


9 


AL360219 


AL360219 Human DNA 


cl325 


40 


1. 


7 


178433 


9 


CNSOIRHH 


AL161752 Human chr 


1326 


40 


1. 


7 


180574 


9 


AC092807 


AC092807 Homo sapi 


1327 


40 


1. 


7 


181927 


3 


AC010573 


AC010573 Drosophil 


cl328 


40 


1. 


7 


182963 


2 


AC116815 


AC116815 Mus muscu 


cl329 


40 


1. 


7 


184470 


9 


AL589823 


AL589823 Human DNA 


C1330 


40 


1. 


7 


220897 


2 


AL954635 


AL954635 Homo sapi 


cl331 


40 


1. 


7 


226142 


2 


AC126422 


AC126422 Mus muscu 


cl332 


40 


1. 


7 


243348 


2 


AC150646 


AC150646 Bos tauru 


1333 


40 


1. 


7 


246640 


2 


AC108286 


AC108286 Rattus no 


cl334 


40 


1. 


7 


265352 


2 


AC095748 


AC095748 Rattus no 


1335 


40 


1 . 


7 


278708 


3 


AE003535 


AE003535 Drosophil 


cl336 


40 


1 . 


7 


300050 


1 


AP004171 


AP004171 Mycoplasm 


1337 


40 


1. 


7 


349980 


6 


AX344552 " 


AX344552 Sequence 


cl338 


39. 8 


1. 


7 


289 


• 6 


AR162089 


AR162089 Sequence 


cl339 


39 . 8 


1. 


7 


289 


6 


AR166614 


AR166614 Sequence 


1340 


39. 8 


1. 


7 


993 


8 


MECGT7 


X77464 M.esculenta 


1341 


39. 8 


1. 


7 


1371 


6 


AX412848 


AX412848 Sequence 


1342 


39. 8 


1. 


7 


1371 


6 


AX506822 


AX506822 Sequence 


1343 


39.8 


1. 


7 


1371 


6 


AX766276 


AX766276 Sequence 


1344 


39. 8 


1. 


7 


1371 


8 


BT000622 


BT000622 Arabidops 


1345 


39.8 


1. 


7 


1371 


8 


BT005368 


BT005368 Arabidops 


1346 


39. 8 


1. 


7 


1407 


6 


AX653241 


AX653241 Sequence 


1347 


39.8 


1. 


7 


1437 


6 


AX211638 


AX211638 Sequence 


cl348 


39.8 


1. 


7 


1437 


6 


AX211640 


AX211640 Sequence 


1349 


39. 8 


1. 


7 


1451 


6 


AX211616 


AX211616 Sequence 


1350 


39. 8 


1. 


7 


1494 


6 


AX211622 


AX211622 Sequence 


1351 


39. 8 


1. 


7 


1498 


8 


BT002579 


BT002579 Arabidops 


1352 


39. 8 


1. 


7 


1516 


8 


AK118431 


AK118431 Arabidops 


1353 


39. 8 


1. 


7 


1559 


8 


AY125506 . 


AY125506 Arabidops 


1354 


39.8 


1. 


7 


1687 


8 


AY062668 


AY062668 Arabidops 


cl355 


39. 8 


1. 


7 


1702 


3 


PFAHGPTA 


M88110 Plasmodium 


1356 


39. 8 


1. 


7 


2559 


6 


AX766273 


AX766273 Sequence 


1357 


39.8 


1. 


7 


2817 


14 


AF052502 


AF052502 Epiphyas 


cl358 


39. 8 


1. 


7 


11086 


1 


AE010596 


AE010596 Fusobacte 


1359 


39. 8 


1. 


7 


60193 


9 


7VL359705 


AL359705 Human DNA 


cl360 


39. 8 


1. 


7 


73184 


2 


AGO 9 08.64... 


AC090864 Homo sapi' 


cl361 


39. 8 


1 . 


7 


79991 


2 


AC108505 


AC108505 Oryza sat 


cl362 


39. 8 


1. 


7 


87637 


9 


AL390960 


AL390960 Human DNA 


cl363 


39. 8 


1. 


7 


104001 


8 


AC023628 


AC023628 Arabidops 


1364 


39. 8 


1. 


7 


107139 


2 


AL360223 


AL360223 Homo sapi 


1365 


39.8 


1. 


7 


108881 


8 


AC002391 


AC002391 Arabidops 


1366 


39. 8 


1. 


7 


109659 


2 


AC151499 


AC151499 Dasypus n 


cl367 


39.8 


1. 


7 


110000 


2 


AC116234 1 


Continuation (2 of 


1368 


39.8 


1. 


7 


118584 


14 


AY043265 


AY043265 Epiphyas 


1369 


39.8 


1. 


7 


124186 


9 


AL772392 


AL772392 Human DNA 


1370 


39.8 


1. 


7 


144572 


5 


BX322794 


BX322794 Zebrafish 


1371 


. 39.8 


1. 


7 


146782 


2 


AC127599 


AC127599 Rattus no 



cl372 


39 . 


o 
O 


1 


. 7 


153576 


2 


AC020572 




AC020572 


Homo sapi 


1373 


o9 . 


o 
o 


1 




162605 


2 


■jk<-»ft^'ftyftft 
AC068698- 




fk ^ ft /" ft ft ft 
AC068698 


Homo sapi 


cl374 


39 . 


8 


1 


. 7 


163088 


9 


AC097654 




AC097654 


Homo sapi 


1375 


39. 


8 


1 


. 7 


163364 


2 


AC116241 




AC116241 


Rattus no 


cl376 


39 . 


8 


1 


. 7 


180005 


2 


AC136070 




AC136070 


Rattus no 


cl377 


39 . 


8 


1 


. 7 


187651 


8 


AC129718 




AC129718 


Oryza sat 


cl378 


39 . 


8 


1 


. 7 


189139 


2 


AC122947 




AC122947 


Rattus no 


cl379 


39 . 


8 


1 


. 7 


198635 


2 


AC121182 




AC121182 


Rattus no 


1380 


39 . 


8 


1 


. 7 


ft ft -1 ii ft 
201427 


2 


AC127843 




AC127843 


Rattus no 


cl381 


39 . 


8 


1 


. 7 


213678 


2 


AC137173 




AC137173 


Rattus no 


cl382 


39. 


8 


1 


. 7 


215675 


2 


AC123264 




AC123264 


Rattus no 


cl383 


39 . 


8 


1 


. 7 


220283 


2 


AC103391 




AC103391 


Mus muscu 


cl384 


39 . 


8 


1 


. 7 


221631 


9 


AC010867 




AC010867 


Homo sapi 


cl385 


39. 


8 


1 


. 7 


224818 


2 


AC111839 




AC111839 


Rattus no 


cl386 


39. 


8 


1 


. 7 


231770 


2 


AC109171 




AC109171 


Mus muscu 


1387 


39. 


8 


•1 


.7 


238637 


2 


AC092254 




AC092254 


Mus muscu 


1388 


39. 


8 


1 


. 7 


245515 


2 


AC151275 




AC151275 


Mus muscu 


1389 


39. 


8 


1 


.7 


245795 


2 


AC096419 




AC096419 


Rattus no 


1390 


39. 


8 


1 


.7 


249487 


2 


AC095161 




AC095161 


Rattus no 


1391 


39.. 


8 


1 


. 7 


250178 


2 


AC118121 




AC118121 


Rattus no 


cl392 


39. 


8 


1 


. 7 


253188 


2 


AC098354 




AC098354 


Rattus no 


1393 


39 . 


8 


1 


. 7 


273729 


2 


AC106147 




AC106147 


Rattus no 


cl394 


39. 


8 


1 


. 7 


277363 


2 


AC134745 




AC134745 


Rattus no 


1395 


39. 


8 


1 


. 7 


286451 


2 


AC115130 




AC115130 


Rattus no 


cl396 


39 . 


8 


1 


. 7 


299467 


2 


AC120784 




AC120784 


Mus muscu 


1397 


39. 


6 


1 


.7 


1231 


1 


AY234840 




AY234840 


Staphyloc 


1398 


39. 


6 


1 


.7 


1368 


14 


AF166115 




AF166115 Potato vi 


1399 


39. 


6 


1 


. 7 


1387 


8 


AY117336 




AY117336 Arabidops 


1400 


39. 


6 


1 


.7 


1419 


6 


AX653260 




AX653260 


Sequence 


1401 


39 . 


6 


1 


. 7 


1433 


6 


AX211617 




AX211617 


Sequence 


1402 


39 . 


6 


1 


. 7 


1443 


14 


PVYDNAO 




Z50042 Potato viru 


1403 


39. 


6 


1 


.7 


1443 


14 


PVYDNAl 




Z50041 Potato viru 


1404 


39 . 


6 


1 


. 7 


1443 


14 


PVYDNAONA 




Z50043 Potato viru 


1405 


39. 


6 


1 


. 7 


1596 


8 


AY084880 




AY084880 Arabidops 


1406 


39 . 


6 


1 


. 7 


1689 


8 


AY080716 




AY080716 Arabidops 


1407 


39 . 


6 


1 


.7 


1980 


14 


BSU61154 




U61154 Buzura supp 


cl408 


39 . 


6 


1 


. 7 


5858 


3 


AF350276 




AF350276 Nephila m 


1409 


39 . 


6 


1 


. 7 


6104 


6 


AX346269 




AX346269 


Sequence 


cl410 


39. 


6 


1 


.7 


8004 


6 


CQ588884 




CQ588884 


Sequence 


1411 


39. 


6 


1 


.7 


8033 


3 


PPINMP 




Y13117 Paramecium 


cl412 


39. 


6 


1 


. 7 


8736 


6 


CQ588875 




CQ588875 


Sequence 


1413 


39 . 


6 


1 


. 7 


9958 


3 


AF246689 




AF246689 


Dictyoste 


1414 


39 . 


6 


1 


.7 


12500 


3 


AF482381 




AF482381 


Dictyoste 


1415 


39. 


6 


1 


.7 


15052 


1 


AF055579 




AF055579 


Streptomy 


cl416 


39 . 


6 


1 


.7 


34919 


3 


AF100663 




AF100663 


Caenorhab 


cl417 


39 . 


6 


1 


. 7 


37225 


9 


AC005954 




AC005954 


Homo sapi 


1418 


39 . 


6 


1 


. 7 


49817 


2 


AC013939 




AC013939 


Drosophil 


1419 


39 . 


6 


1 


.7 


50937 


6 


AR159871 




AR159871 


Sequence 


cl420 


39 . 


6 


1 


. 7 


53920 


9 


AC116423 




AC116423 


Homo sapi 


1421 


39 . 


6 


1 


. 7 


71225 


9 


AL358196 




AL358196 


Human DNA 


1422 


39 . 


6 


1 


.7 


74492 


8 


AY350713 




AY350713 


Capsella 


cl423 


39 . 


6 


1 


. 7 


83528 


8 


AP005695 




AP005695 


Oryza sat 


cl424 


39 . 


6 


1 


.7 


85916 


3 


AC117080 




AC117080 


Dictyoste 


C1425 


39. 


6 


1 


.7 


103908 


2 


AC150245 




AC150245 


Medicago 


C1426 


39. 


6 


1 


.7 


128018 


2 


BX890615 




BX890615 


Danio rer 


C1427 


39. 


6 


1 


.7 


138158 


5 


BX005232 




BX005232 


Zebrafish 


1428 


39. 


6 


1 


.7 


139147 


2 


AC006725 




AC006725 


Caenorhab 



C1429 


39. 


6 


1. 


7 


148397 


2 


CR751563 


1430 


39. 


6 


1. 


7 


150876 


2 


AC108761 


1431 


39. 


6 


1. 


7 


154082 


2 


AC108756 


C1432 


39. 


6 


1. 


7 


154195 


2 


AC013641 


C1433 


39. 


6 


1. 


7 


167195 


3 


AC007808 


1434 


39. 


6 


1. 


7 


170425 


5 


BX465867 


1435 


39. 


6 


1. 


7 


179399 


5 


AL929338 


C1436 


39. 


6 


1. 


7 


185672 


2 


BX640474 


1437 


39. 


6 


1. 


7 


187495 


9 


AC026887 


1438 


39. 


6 


1. 


7 


188638 


2 


CR759889 


1439 


39. 


6 


1. 


7 


210700 


2 


AC103382 


1440 


39. 


6 


1. 


7 


212134 


2 


AC024037 


1441 


39. 


6 


1. 


7 


212499 


2 


CR626887 


C1442 


39. 


6 


1. 


7 


215210 


2 


CR749744 


1443 


39. 


6 


1. 


7 


219892 


2 


AC127200 


1444 


39. 


6 


1. 


7 


220817 


2 


AC113854 


1445 


39, 


6 


1. 


7 


226092 


2 


AC107090 


C1446 


39. 


6 


1. 


7 


233220 


2 


AC098058 


1447 


39. 


6 


1. 


7 


254633 


2 


CR762389 


1448 


39. 


6 


1. 


7 


254733 


3 


AC117075 


C1449 


39. 


6 


1. 


7 


254961 


3 


AE003706 


1450 


39. 


6 


1. 


7 


269863 


2 


AC131017 


1451 


39. 


4 


1. 


7 


640 


6 


AX652149 


1452 


39. 


4 


1. 


7 


1437 


6 


AX211602 


1453 


39. 


4 


1. 


7 


1437 


6 


AX651766 


1454 


39. 


4 


1. 


7 


1479 


8 


AB191246 


1455 


39. 


4 


1. 


7 


1918 


8 


AK102481 


C1456 


39. 


4 


1. 


7 


1921 


6 


CQ577878 


C1457 


39.4 


1. 


7 


2055 


8 


AK106646 


C1458 


39. 


4 


1. 


7 


2534 


3 


BT001649 


1459 


39. 


4 


1. 


7 


4141 


6 


CQ577877 


1460 


39. 


4 


1. 


7 


6636 


6 


AX344615 


1461 


39. 


4 


1. 


7 


11155 


6 


AX345507 


C1462 


39. 


4 


1. 


7 


38801 


3 


CEK09B11 


C1463 


39. 


4 


1. 


7 


39329 


9 


AC004091 


C1464 


39. 


4 


1. 


7 


43644 


9 


AC004092 


1465 


39. 


4 


1. 


7 


47108 


6 


AX344507 


C1466 


39. 


4 


1. 


7 


60220 


. 3 


AC115604 


C1467 


39. 


4 


1. 


7 


74630 


9 


AL157903 


C1468 


39. 


4 


1. 


7 


87501 


8 


CR380951 6 


C1469 


39. 


4 


1. 


7 


99699 


2 


AC013845 


C1470 


39. 


4 


1. 


7 


117099 


8 


AP003572 


C1471 


39. 


4 


1. 


7 


127131 


8 


AC146819 


C1472 


39. 


4 


1. 


7 


128223 


8 


AP005643 


C1473 


39. 


4 


1. 


7 


131734 


2 


CR792420 


1474 ^ 


.39. 


4 


1. 


7 


136471 


8 


AP003622 


1475 


39. 


4 


1. 


7 


138850 


8 


AC133341 


1476 


39. 


4 


1. 


7 


140757 


9 


AL157778 


C1477 


39. 


4 


1. 


7 


144022 


5 


BX088594 


C1478 


39. 


4 


1. 


7 


147245 


8 


AP003217 


1479 


39. 


4 


1. 


7 


147640 


2 


AP003542 


1480 


39. 


4 


1. 


7 


155906 


8 


AP005934 


C1481 


39. 


4 


1. 


7 


160285 


2 


CR848001 


1482 


39. 


4 


1. 


7 


168269 


2 


CR352217 


1483 


39. 


4 


1. 


7 


171165 


2 


CR376767 


1484 


39. 


4 


1. 


7 


172854 


3 


AC023706 


C1485 


39. 


4 


1. 


7 


175947 


8 


AP003435 



CR751563 Danio rer 

AC108761 Oryza sat 

AC108756 Oryza sat 

AC013641 Homo sapi 

AC007808 Drosophil 

BX465867 Zebrafish 

AL929338 Zebrafish 

BX640474 Danio rer 

AC026887 Homo sapi 

CR759889 Danio rer 

AC103382 Mus muscu 

AC024037 Homo sapi 

CR626887 Danio rer 

CR749744 Danio rer 

AC127200 Rattus no 

AC113854 Rattus no 

AC107090 Rattus no 

AC098058 Rattus no 

CR762389 Danio rer 

AC117075 Dictyoste 

AE003706 Drosophil 

AC131017 Rattus no 

AX652149 Sequence 

AX211602 Sequence 

AX651766 Sequence 

AB191246 Dianthus 

AK102481 Oryza sat 

CQ577878 Sequence 

AK106646 Oryza sat 

BT001649 Drosophil 

CQ577877 Sequence 

AX344615 Sequence 

AX345507 Sequence 
Z83114 Caenorhabdi 

AC004091 Human Cos 

AC004092 Human Cos 

AX344507 Sequence 

AC115604 Dictyoste 

AL157903 Human DNA 
Continuation (7 of 

AC013845 Drosophil 

AP003572 Oryza sat 

AC146819 Medicago 

AP005643 Oryza sat 

CR792420 Danio rer 

.AP00362'2 Oryza sat 

AC133341 Medicago 

AL157778 Human DNA 

BX088594 Zebrafish 

AP003217 Oryza sat 

AP003542 Oryza sat 

AP005934 Oryza sat 

CR848001 Danio rer 

CR352217 Danio rer 

CR376767 Danio rer 

AC023706 Drosophil 

AP003435 Oryza sat 



1486 


39. 


4 


1. 


7 


177743 


9 


AC105941 


AC105941 Homo sapi 


cl487 


39. 


4 


1. 


7 


183648 


3 


AC117081 


AC117081 Dictyoste 


C1488 


39. 


4 


1. 


7 


186739 


9 


AC072028 


AC072028 Homo sapi 


1489 


39. 


4 


1. 


7 


189441 


2 


AC022989 


AC022989 Homo sapi 


C1490 


39. 


4 


1. 


7 


190930 


2 


BX936353 


BX936353 Danio rer 


1491 


39. 


4 


1. 


7 


191590 


3 


AC023722 


AC023722 Drosophil 


C1492 


39. 


4 


1. 


7 


192634 


2 


AC069466 


AC069466 Mus muscu 


1493 


39. 


4 


1. 


7 


197419 


8 


ATCHRIV41 


AL161541 Arabidops 


1494 


39. 


4 


1. 


7 


198019 


5 


BX469910 


BX469910 Zebrafish 


1495 


39. 


4 


1. 


7 


198946 


10 


AL844548 


AL844548 Mouse DNA 


1496 


39. 


4 


1. 


7 


200252 


8 


ATFCA3 


Z97338 Arabidopsis 


C1497 


39. 


4 


1. 


7 


205225 


2 


CR812896 


CR812896 Danio rer 


1498 


39. 


4 


1. 


7 


216131 


5 


BX005004 


BX005004 Zebrafish 


C1499 


39. 


4 


1. 


7 


224731 


2 


AC105545 


AC105545 Rattus no 


1500 


39. 


4 


1. 


7 


273275 


3 


AE014828 


AE014828 Plasmodiu 



ALIGNMENTS 



RESULT 1 
AX697213 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AX697213 2320 bp DNA 

Sequence 281 from Patent WO0078961. , 
AX697213 

AX697213.1 GI: 29498151 



TITLE 

JOURNAL 

FEATURES 

source 



ORIGIN 



linear PAT 02-APR-2003 



Homo sapiens (human) 
Homo sapiens 
Eukaryota; Metazoa; 
Mammalia ; Eutheria ; 
1 

Ferrara^N., Stewart, 
Eaton, D.L. , Gao,W.Q. 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



T.A., Williams, P.M. , Baker, K.P., Desnoyers , L. , 
, Pan, J., Botstein,D., Fong,S., Goddard,A., 
Godowski, P. J. , Gurney,A.L., Smith, V., Tumas,D., Wood, W. I., 
Grimaldi, C. J. , Hillan,K.J., Paoni,N.F., Roy,M.A. and Watanabe,C. K. 
Secreted and transmembrane polypeptides and nucleic acids encoding 
the same 

Patent: WO 0078961-A 281 28-DEC-2000; 
Genentech Inc. (US) 

Location/Qualifiers 

1. .2320 

/organism="Homo sapiens" 
/mol_type= "una s signed DNA" 
/db xref="taxon:9606" 



Query Match 100.0%; Score 2320; DB 6; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 2320; Conservative 0; Mismatches 



Length 2320; 
0; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 



Qy 



61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 



Db 61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

Qy 121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGG7\AGCCATTATCTACT 180 

Qy 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

Qy 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

Qy 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 
Db 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I M I I M M 
Db 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTT^TGTTCTAGAATACTTGGC 420 

Qy 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I 
Db 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

Qy 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I M 

Db 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

Qy 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

Qy 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

Qy 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I i I I I I I I I I I I 
Db 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 

Qy 721 GTCTACATTTGAC7\ACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 721 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 

Qy 781 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATT7VACTCTGACTTTGCCTTTGATTTTGC 840 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I t I I 
Db 781 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 840 

Qy 841 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 841 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 

Qy 901 AGTACCACAAGACTTGGAGT^ACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 

I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 



Qy 961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1020 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1020 

Qy 1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 

Db 1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 

Qy 1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

Qy 1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1200 

Qy 1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1260 

Qy 1261 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1261 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1320 

Qy 1321 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 

Qy 1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

Qy 1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I 

Db 1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

Qy 1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

Qy 1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

Qy 1.621 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGT.TT.GGTGGGCGATG 1680 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1680 

Qy 1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 

Qy 1741 TATCTCCTGTTTTCTTGAAGAACAGGAA7WVTGGCCAAAAATCATCCTTTCCACTTGCTA 1800 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1741 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1800 



Qy 1801 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 1860 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 1860 

Qy 1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 

I I I I I I I I I I I I I i I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 

Qy 1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCGACCTTCCTTCTCATGCGCCTCTC 1980 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 1980 

Qy 1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n I I I I I I I 

Db 1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

Qy 2041 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 2041 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 

Qy 2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

Qy 2161 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2161 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2220 

Qy 2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2280 

Qy 2281 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTCACTAA 2320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2281 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTCACTAA 2320 



RESULT 2 
AY358416 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AY358416 2320 bp mRNA linear PRI 03-OCT-2003 

Homo sapiens clone DNA71169 glucuronosyltransf erase (UNQ842) mRNA, 
complete cds . 
AY358416 

AY358416.1 GI: 37181956 
FLI_CDNA. 

Homo sapiens.-,.(huinan) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 2320) • 

Clark, H.F., Gurney,A.L., Abaya,E., Baker, K., Baldwin, D., Brush, J., 
Chen, J., Chow,B., Chui,C., Crowley, C, Currell,B., Deuel, B., 
Dowd,P., Eaton, D., Foster, J., Grimaldi,C., Gu,Q., Hass,P.E., 
Heldens,S., Huang, A., Kim,H,S., Klimowski, L, , Jin,Y., Johnson, S,, 
Lee, J., Lewis, L., Liao,D., Mark,M., Robbie, E., Sanchez, C, 
Schoenfeld, J. , Seshagiri, S . , Simmons, L., Singh, J., Smith,V., 
Stinson,J., Vagts,A. , Vandlen,R-, Watanabe,C., Wieand,D., Woods, K., 



TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



Xie,.M.H., Yansura^D., Yi,S., Yu,G., Yuan, J., Zhang, M., Zhang, Z., 
Goddard, A. , Wood, W.I. and Godowski,P. 

The Secreted Protein Discovery Initiative (SPDI) , a Large-Scale 

Effort to Identify Novel Human Secreted and Transmembrane Proteins: 

A Bioinformatics Assessment 

Genome Res. 13 (10), 2265-2270 (2003) 

12975309 

2 (bases 1 to 2320) 

Clark, H.F. 

Direct Submission 

Submitted (Ol-AUG-2003) Department of Bioinformatics, Genentech, 
Inc., 1 DNA Way, South San Francisco, CA 94080, USA 

Location/Qualifiers 

1. .2320 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref =" taxon : 9606" 
/clone="DNA71169" 
1. .2320 

/locus_tag="UNQ842" 
68. .1639 

/ 1 ocus_tag= "UNQ 8 42" 

/note="PR01780" 

/codon_start=l 

/ product="glucuronosyltransf erase" 
/protein_id="AAQ88782 . 1" 
/db_xref="GI : 37181957" 

/translation="MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQ 
ILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLG 
GRGKFENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVTVETFDYCPFLIAEKLGK 
PFVAILSTSFGSLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQ 
STFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPI 
KPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCS 
HWPKDVHLAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFG 
DQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLS 
PTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGKLLG 
MAVWWLRGARKVKET " 



ORIGIN 



Query Match 100.0%; Score 2320; DB 9; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2320; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 



AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 6 0 



Qy 

Db 

Qy 

Db 



61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

121 GCTCTCAGAGGCTGCCAA/^TCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 



Qy 



181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



240 



Db 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

Qy 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCT^GTTATCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

Qy 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I M I I I I I I I I I I I I I I I I I I 

Db 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

Qy 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

Qy 481 CTTCGACATGGTGATAGTTG7UVACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

Qy 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

Qy 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

I I I I I I I I I I I I I M I I I I I I I I I M I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

Qy 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGC/SkACAGCACATGCA 720 

Qy 721 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 

Qy 781 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 840 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 

Db 781 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 840 

Qy 841 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I 

Db 841 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 

Qy 901 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 

Qy 961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1020 



Qy 1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I i I I I I I I I I I I I I I I I I I I 
1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1200 

1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGTVAAACAT 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1260 

1261 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1320 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I i I I I I 
GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1320 



1261 



1321 



1321 



GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 



1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

1621 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I 
1621 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1680 

1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I n I I I I 

1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 
1741 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1800 

I II I II I I I I II I I I I II II I I I I II I I I II II M I I II II I II I I II I I II II I I I I II 

1741 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1800 

1801 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 1860 

II I II I I I I I II I I II II II II I I I I I II I II I II I I I I I II I I II II I II I I I I I I II I 
1801 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 1860 

1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 

I I I I I M I M I I II I I II II I I I II I I I I I II I II I I I I II I II M II I II I I M I II I I 

1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 



Qy 

Db 



1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 1980 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

2041 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
2041 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 

2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

I I I I I M I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

2161 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2220 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 
2161 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA' 2220 

2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2280 

2281 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTCACTAA 2320 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2281 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTCACTAA 2320 
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KEYWORDS 
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AX136141 2341 bp DNA linear PAT 30-MAY-2001 

Sequence 63 from Patent EP1067182. 

AX136141 

AX136141.1 GI: 14272549 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Ota,T., Isogai^T., Nishikawa, T . , Kawai,Y., Sugiyama,T. and 
Hayashi^K. 

Secretory protein or membrane protein 

Patent: EP 1067182-A 63 lO-JAN-2001; 

Helix Research Institute (JP) ...... 

Location/Qualifiers 

1. .2341 

/organism="Homo sapiens" 
/mol_type="unas signed ,DNA" 
/db_xref ="taxon : 9606" 
94. .1665 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="CAC39713. 1" 
/db_xref ="GI : 14272550" 

/translation="MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQ 



ILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLG 

GRGKFENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGK 

PFVAILSTSFGSLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQ 

STFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPI 

KPVPQDLENFIAKFEDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCS 

HWPKDVHIAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVP^^ 

DQPENMVRVEAKKFGVSIQLKKLKAETIALKMKQIMEDKRYKSAAVAASVILRSHPLS 

PTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLLDVFVFLLGLTLGTLWLCGKLLG 

MAVWWLRGARKVKET " 

ORIGIN 

Query Match 99.6%; Score 2310.2; DB 6; Length 2341; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2312; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 

I I I I I I I M I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 27 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 86 

Qy 61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 87 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 146 

Qy 121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 147 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 206 

Qy 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I 
Db 207 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCAT7\ATGTCACCATGCTTAACCA 266 

Qy 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 267 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 326 

Qy 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I 
Db 327 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 386 

Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I I I I I I I I I I I I I I I III I I I M I I I I I I I I I I I M I II I M I I I I II II I I I I 

Db 387 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTA7\ATGTTCTAGAATACTTGGC 446 

Qy 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

II I I II I II I I I I II 11 I I I I II II II I II I I II I I II I II I II II I I I II I II II I II I 

Db 447 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATjGAGAA 506 

Qy 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 54 0 

I I I I I I I I I I I II I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 507 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 566 

Qy 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 II II 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

Db 567 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 626 



Qy 



601 



AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 

I II 11 I II I I II I II I II I I II I II M II I II II I II II II II II II I II I II II I II II 



660 



627 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 686 

661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCT^ACAGCACATGCA 720 
I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

687 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 746 

721 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
747 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 806 

781 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 840 
I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

807 TCATCTTCTACTG7\AAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 866 

841 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
867 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 926 

901 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
927 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGAGGACTCTGGTTTTGTCCTTGT 986 

961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
987 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1046 

1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 
1047 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1106 

1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
1107 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1166 

1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

1167 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1226 

1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1260 

I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1227 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1286 

1261 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1320 

I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1287 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1346 

1321 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1347 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1406 

1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M 
1407 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1466 



1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1467 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1526 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
1527 GCCCTGGCATGAGCAGTACCTGCTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1586 

1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1587 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 164 6 

1621 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I U I I I I I I I I I I I I I I I I I I I I I 
1647 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1706 

1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1707 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1766 

1741 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1767 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 



1826 



1801 



ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 1860 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
1827 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAATTCTTTCCAGTCCT 1886 

1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1887 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1946 

1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
1947 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 2006 

1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I 
2007 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2066 

2041 TGCCCGTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

2067 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2126 

2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

.1 I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2127 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2186 



2161 



2220 



AGTTCAGGGCCGGACACAGGCT.CACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

2187 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 224 6 

2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2280 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 



2247 



2306 



2315 



2281 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2307 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTC 2341 



RESULT 4 
BD123523 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



BD123523 2341 bp DNA 

Secretory protein or membrane protein. 
BD123523 

BD123523.1 GI:23218468 
JP 2002017376-A/32. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 2341) 
Ota^T., Isogai,.T., Nishikawa^T 
Hayashi, K. 

Secretory protein or membrane protein 
Patent: JP 2002017376-A 32 22-JAN-2002; 
HELIX RESEARCH INSTITUTE 



linear PAT 18-SEP-2002 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 

Kawai^Y., Sugiyama^T. and 



Homo sapiens (human) 
JP 2002017376-A/32 
22~JAN-2002 

07-JUL-2000 JP 2000253173 

TOSHIO OTA.TAKAO ISOGAI , TETSUO NI SHI KAWA, YURI KAWTVI , TOMOYASU 

SUGIYAMA, 

KOJI HAYASHI 



OS 
PN 
PD 
PF 
PI 
PI 
PI 
PC 

C12N15/09,C07K14/47,C07K16/18^C12N1/15,C12N1/19,C12N1/21,C12N5/ PC 
10, 

PC C12P21/02,C12Ql/68//C12P21/08,C12N15/00,C12N5/00 CC 
Secretory protein or membrane protein 
FH Key 
FT CDS 



FEATURES 

source 



ORIGIN 



Location/Qualifiers 
(94). .(1662). 

Location/Qualifiers 

1. .2341 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db xref="taxon:9606" 



Query Match 99.6%; Score 2310.2; 

Best Local Similarity 99.9%; Pred. No. 0; 
Matches 2312; Conservative 0; Mismatches 



DB 6; Length 2341; 



3; Indels 



Qy 

Db 



0 ; Gaps 



0; 



1 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
27 AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 86 



Qy 

Db 

Qy 

Db 



61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 120 

I I I I I I I I I M I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
87 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 146 



121 



180 



GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 
I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
147 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 206 



Qy 



181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



207 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 266 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db. 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

I I I I I I I I I I t I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
267 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 326 

301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
327 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 386 



361 



387 



421 



447 



481 



507 



541 



567 



601 



627 



661 



687 



721 



747 



781 



807 



841 



867 



901 



927 



AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 
I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 446 

GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 
I I I II I I I I I I I II II I I I I I I I II I II I II I I I I I I I II M II II I I I II I I II II I I I 
GTTGCAGTGCAGTCATTTTTTAAATAGA7\AGGATATCATGGATTCCTTAAAGAATGAGAA 506 

CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

M M I I I I I I I I I I I M I I II I I I M I I I I I M I I I II I I II I I I I I M I I I I I I I II M 

CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 566 

TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 626 

AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I 
AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 686 

GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 
I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 746 

GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 806 

TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 840 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 866 

TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 
I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 926 

AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGAGGACTCTGGTTTTGTCCTTGT 986 



1020 



961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I II I 
987 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1046 

1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
1047 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGG7VAGTGTCAGTGTTCTCATTGGCCCAA 1106 



Qy 1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

M I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I 
Db 1107 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1166 

Qy 1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 
Db 1167 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1226 

Qy 1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 1227 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1286 

Qy 1261 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1320 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1287 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 134 6 

Qy 1321 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1347 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1406 

Qy 1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1407 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 14 66 

Qy 1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1467 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1526 

Qy 1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

1111111111111111111111 I r I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I 
Db 1527 GCCCTGGCATGAGCAGTACCTGCTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 158,6 

Qy 1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 1587 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1646 

Qy 1621 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1647 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1706 

Qy 1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1707 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1766 

Qy . 1741 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1767 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1826 

Qy 1801 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1827 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAATTCTTTCCAGTCCT 1886 

Qy 1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 1887 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1946 



Qy 1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 1980 

M I i I I I I I I I I I I I I I I I 11 I I i I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 1947 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 2006 

Qy 1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I M I I I I I I 
Db 2007 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2066 

Qy 2041 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2067 TGCCCCT ACT ATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2126 

Qy 2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2127 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2186 

Qy 2161 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2187 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2246 

Qy 2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2280 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n I 

Db 2247 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2306 

Qy 2281 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTC 2315 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2307 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTC 2341 
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DEFINITION 
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AUTHORS 



TITLE 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



AK075383 ,2341 bp mRNA linear PRI 03-SEP-2002 

Homo sapiens cDNA PSEC0073 fis, clone NT2RP2002934 , weakly similar 
to UDP-GLUCURONOSYLTRANSFERASE 2C1 MICROSOMAL (EC 2.4.1.17). 
AK075383 

AK075383,1 GI: 22761433 

oligo capping; fis (full insert sequence) . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Ota,T.^ Nishikawa^T. , Suzuki^Y., Kawai-Hio, Y. , Hayashi,K., 

Ishii^S.^ Saito,K., Yamamoto,J.^ Wakamatsu, A. , Nagai,T., 

Nakamura,Y., Nagahari^K., Sugano,S. and Isogai^T. 

HRI human cDNA- sequencing project > 

Unpublished 

2 (bases 1 to 2341) 

Isogai^T. and Yamamoto,J. 

Direct Submission 

Submitted (20-MAR-2002 ) Takao Isogai, Helix Research Institute^ 
Genomics Laboratory; 1532-3 Yana, Kisarazu, Chiba 292-0812^ Japan 
(E-mail: genomics Shri. CO. jp, Tel : 81-438-52-3975, Fax: 81-438-52-3986) 
HRI hiiman cDNA sequencing project; cDNA 5'- & 3 '-end one pass 
sequencing, clone selection and full insert sequencing: Helix 
Research Institute (supported by Japan Key Technology Center etc.); 
cDNA library construction: Institute of Medical Science, University 



of Tokyo, Laboratory of Genome Structure, Human Genome Center. 
FEATURES Location/Qualifiers 
source 1. .2341 

. /organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/ cl one= "NT2 RP2 0 0 2 9 3 4 " 
/cell_line="NT2" 
/ cell_type="teratocarcinoma" 
/clone_lib="NT2RP2 " 

/note="cloning vector: pME18SFL3-mRNA from NT2 neuronal 
precursor cells after 2-weeks retinoic acid (RA) 
induction" 
CDS 94. .1665 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="BAC11583.1" 
/db_xref="GI: 22761434" 

/translation="MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQ 
ILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLG 
.GRGKFENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGK 
PFVAILSTSFGSLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQ 
STFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPI 
KPVPQDLENFIAKFEDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCS 
HWPKDVHLAANVKIVI)WLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFG 
DQPENMVRVE7SJ<KFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAV;^ 
PTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLLDVFVFLLGLTLGTLWLCGKLLG 
MAVWWLRGARKVKET " 

ORIGIN 



Query Match 99.6%; Score 2310.2; DB 9; Length 2341; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2312; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 



Qy 


1 


AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


27 


AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 


86 


Qy 


61 


AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


87 


AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 


146 


Qy 


121 


GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


147 


GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 


206 


Qy 


181 


GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


207 


GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 


266 


Qy 


241 


CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAA7VAATCATATCAAGTTATCAG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


267 


CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 


326 


Qy 


301 


TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 


360 






1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


327 


TTGGCTTGCACCTGAAGATCATCAAAGAGAATTT7VAAAAGAGTTTTGATTTCTTTCTGGA 


386 



Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 387 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 446 

Qy 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I i I I I I I I I I I I I I I I I I I 

Db 447 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 506 

Qy 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
Db 507 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 566 

Qy 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 567 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 626 

Qy 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 627 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 686 

Qy 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I M I I I I I I I I I I 
Db 687 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 746 

Qy 721 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 

I I I I I I I I I I I I I I I I I I I I I I I I U I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 747 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 806 

Qy 781 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I 
Db 807 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 866 

Qy 841 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 867 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 926 

Qy 901 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I 
Db 927 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGAGGACTCTGGTTTTGTCCTTGT 986 

Qy 961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 987 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1046 

Qy 1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1047 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1106 

Qy 1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M i I 
Db 1107 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1166 

Qy 1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGjDLATAGCATAATGGAGGC 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 1167 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1226 



Qy 1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1260 

I I I I I I I I I I I I I M I I I I 1 I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1227 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1286 

Qy 1261 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTC7\AGGCAGA 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I 
Db 1287 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1346 

Qy 1321 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I 
Db 1347 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1406 

Qy 1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 07 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 14 66 

Qy 1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 67 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1526 

Qy 1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1527 GCCCTGGCATGAGCAGTACCTGCTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1586 

Qy 1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1587 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1646 

Qy 1621 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1647 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1706 

Qy 1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1707 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1766 

Qy 1741 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1767 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1826 

Qy 1801 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGA/^ATCTTTCCAGTCCT 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1827 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAATTCTTTCCAGTCCT 1886 

Qy 1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.UUM I I I I I I I I I I I I I I I I 
Db 1887 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1946 

Qy 1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db ^ 1947 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 2006 

Qy 1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2007 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2066 

Qy 2041 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 



Db 



2067 



2126 



Qy 2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 2127 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2186 

Qy 2161 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 2187 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2246 

Qy 2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCGAGATTTCCTGAAA 2280 

I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2247 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2306 

Qy 2281 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTC 2315 

I I I I I I I I I i I I I I I 1 I I I I I I I I I I I I I I I I I I I 
Db 2307 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTC 2341 



RESULT 6 
AX548037 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AX548037 2944 bp DNA 

Sequence 13 from Patent WO02066654. 

AX548037 

AX548037.1 GI: 25813133 



(human) 



linear PAT 26-NOV-2002 



Homo sapiens 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Astromof f , A. , Au- Young, J., Baughn,M.R., Ding^L., Duggan,B.M., 
Forsythe, I. J. , Gietzen, K. J. , Griff in, J. A. , Lee,E.A., Lu,Y., 
Richardson, T.W. , Ring,H. Z . , Sanjanwala,M.M. , Swarnakar, A. , 
Walia,N.K., Warren, B. A., Xu,Y., Yue,H. and Zebarjadian,Y. 

Drug metabolizing enzymes 

Patent: WO 02066654-A 13 29-AUG-2002; 

Incyte Genomics, Inc. (US) 

Location/Qualifiers 

1. .2944 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 
/note="Incyte ID No: 7486594CB1" 



Query Match 90.6%; 
Best Local Similarity 95.5%; 
Matches 2216; Conservative 



Score 2102.8; 
Pred. No. 0; 
0; Mismatches 



DB 6; Length 2944; 

2; Indels 102; Gaps 



1; 



Qy 

Db 



204 



AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 60 
I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGA 2 63 



Qy 



61 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 
I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



120 



Db 264 AGTGAGCATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCT 323 

Qy 121 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACT 180 

H I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 324 GCTCTCAGAGGCTGCCAAAATCCTGACAATATCTACAGT 362 

Qy 181 GATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCA 240 

Db 363 362 

Qy 241 CAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 363 AGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAG 4 01 

Qy 301 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 402 TTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGA 461 

Qy 361 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 420 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I 

Db 462 AGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGC 521 

Qy 421 GTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 480 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 522 GTTGCAGTGCAGTCATTTTTTT^AATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAA 581 

Qy , 481 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 540 

I I I I M I I I I I i I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 582 CTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCT 641 

Qy 541 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 600 

I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I 
Db 642 TGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACC 701 

Qy 601 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 702 AATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTG 761 

Qy 661 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 720 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 762 GGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCA 821 

Qy 721 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTC 780 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 822 GTCTACATTTGACAACACCATCAAGGAACATTTCACAGT^GGCTCTAGGCCAGTTTTGTC 881 

Qy 781 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I M M I I 
Db 882 TCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGC 941 

Qy . 841 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 900 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 942 TCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACC 1001 

Qy 901 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 960 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1002 AGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGT 1061 



Qy 961 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1020 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1062 GACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAA 1121 

Qy 1021 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1080 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I 
Db 1122 TGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAA 1181 

Qy 1081 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1140 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1182 AGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCT 1241 

Qy 1141 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1200 

I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 1242 GGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGC 1301 

Qy 1201 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 1302 CATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACAT 1361 

Qy 1261 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
Db 1362 GGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGA 1421 

Qy 1321 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I U I I I I I I 
Db 1422 GACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGT 1481 

Qy 1381 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1440 

I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1482 GGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTG 1541 

Qy 1441 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I 

Db 1542 GATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCA 1601 

Qy 1501 GCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1560 

II II I II I II II II II II II II I I I I I I I I I I II I I I I I I I I II II I I I I I I I I I I I I I 

Db 1602 GCCCTGGCATGAGCAGTACCTGCTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGG 1661 

Qy 1561 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1620 

I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I 
Db 1662 GACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAG 1721 

Qy 1621 AAAGGTG7\AGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1722 AAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATG 1781 

Qy 1681 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1740 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I 
Db 1782 TCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGT 1841 

Qy 1741 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1800 

I I I I I I I I I II I I M II I I I I I I I II II I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I 
Db 1842 TATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGCTA 1901 



Qy 1801 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 1860 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I II M I I I 

Db 1902 ATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAATTCTTTCCAGTCCT 1961 

Qy 1861 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 1920 

II II II II II I M II I II I I I I I II I II II II I II II I I I II II II I I II I II I I II II I 

Db 1962 CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACT 2021 

Qy 1921 TGGACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTC 1980 

II I I I I II I I I I I I I I I I I I I I I I I II i I II I I I I I I I I I II I I I I II I I I II II I II I I 
Db 2022 TGGACCACTGACCCTCAGATTTCCAGCCTTA/WSlTCCACCTTCCTTCTCATGCGCCTCTC 2081 

Qy 1981 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2040 

I I I I I I I I I I II I I I I I I I II II I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I II I I I 
Db 2082 CGAATCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCC 2141 

Qy 2041 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2100 

II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 2142 TGCCCCTACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGT 2201 

Qy 2101 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2160 

I I I I I I I I II I II I I I I II I M I II I I I I I I I I I I I I I I M I I I I I I I I I II I II I II I I 
Db 2202 TTCTGTTTTGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAG 2261 

Qy 2161 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2262 AGTTCAGGGCCGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCA 2321 

Qy 2221 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2280 

I I I I I I I II I I I II I I I I I I I I II I I I I I I I I I I I I I I I MM I I II I I I I I I II I I I I I 
Db 2322 CAGTGAGCTCCTTCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAA 2381 

Qy 2281 AATA7WVGTTTACAGCGTTATCTCTCCCC/JVCCTCACTAA 2320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2382 AATAAAAGTTTACAGCGTTATCTCTCCCCAACCTCACTAA 2421 
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ORIGIN 



/db xref="taxon:9606" 



Query Match 68.1%; Score 1578.8; DB 6; Length 2263; 

Best Local Similarity 92.6%; Pred. No. 0; 

Matches 1658; Conservative 0; Mismatches 132; Indels 0; Gaps 0; 

Qy 7 CCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAG 66 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 22 CCTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAG 81 

Qy 67 CATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTC 126 

I I I I I I I I M I I I I I I I I I I 1 I I i I I I I I I I I I M I M I I I I I I I I I I I I I I I I 
Db 82 CATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTC 141 

Qy 127 AGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGA 186 

I I I I I I I I I I I I M I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I MM 

Db 142 AGAGGCTGCCAAAATCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGA 201 

Qy 187 CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAG 246 

M M M NM M M M M M M M M M M M M M I M M M M I M I I 
Db 202 CCGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGG 261 

Qy 247 AGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCT 306 

I Ml M M I M I M M I MM M M M M M I M M M M M I Ml I 

Db 262 AAAGTTTTTGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTT 321 

Qy 307 TGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAAC 366 

I M M M M II M M I M I I I I II I I I I I II I I II II I I I I II 

Db 322 TTCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGC 381 

Qy 367 TTTAGGTGGCAGAGGAAAATTTGAAAACTT ATTAAATGTTCTAGAATACTTGGCGTTGCA 426 

M I M M I I I I I II I II I II II II I I I M Mil II 

Db 382 ATTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCA 441 

Qy 427 GTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGA 486 

II III I II I III II I I I II II II II I II I I I I II II I I II I I I II II I II 
Db 442 ATGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTA7\AGAATGAGAACTATGA 501 

Qy 487 CATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAA 546 

II II I I I M I I I Mill I I II I I I I M II I II II M M I I II I I I I III 
Db 502 TCTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAA 561 

Qy 547 GCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCC 606 

II I II II I I I I I II I II I MM I II I II I I I I I M I I I I II I II I 11 I I II III 
Db. ...... 562 ACCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTXGGATTTTGGGCTACCAAGCCC 621 

Qy 607 CTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 666 

II M I II I i I I I I II II I I I M I II I II II I I I I I I M II M II I I II I II I I II I I M 

Db 622 CTTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 681 

Qy 667 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTAC 726 

I I II II I I I I I I I M II I I II II II I II I I II I I II II III I II I I II I I I I I II 
Db 682 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTAC 741 

Qy 727 ATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 786 

M II II II I M I M II II I II II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

.Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



742 ATTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 801 



787 



802 



847 



TCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACC 846 
I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

TCTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCC 861 



906 



TCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 
II I II I M II M II II II I II II I M I I I I I I I I I I I I II II I II I II I II II II II I 
862 CCTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 921 

907 ACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTT 966 

I II I I II II I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
922 ACAAGACTTGGACAACTTCATTGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTT 981 

967 GGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTT 1026 

I I I I I I I I II I I I II I I II II II I I II I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I 
982 TGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTT 1041 

1027 TGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGT 1086 
I I I I I II I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I 

1042 TGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGT 1101 

1087 CCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1146 

I I I I I I I I I II M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I 
1102 CCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1161 

1147 CCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCA 1206 

I II I I I I I I I I I I I I II I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
1162 CCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCA 1221 

1207 GCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCG 1266 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1222 GCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGA/^CATGGTCCG 1281 



1267 



1326 



AGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATT 
M I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1282 AGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATT 1341 

1327 GGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGC 1386 

I M I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
1342 GGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGC 1401 

1387 CAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGA 1446 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I 
1402 CAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGA 1461 

1447 CCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTG 1506 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1462 CCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTG 1521 

1507 GCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCT 1566 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
GCATGAGCAGTACCTGCTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCT 1581 



1522 



1567 ATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGT 1626 

I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1582 ATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGT 1641 



Qy 1627 GAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATGTCACCA 1686 

I I I I M I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1642 GAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGGGCGATGTCACCA 1701 



Qy 1687 TTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTCTAGTTATCTC 1746 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1702 TTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTGTAGTCCTTCTAGTTATCTC 1761 

Qy 1747 CTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTT 1796 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 1762 CTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACAT 1811 



RESULT 8 
AX359921 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNTUi 

FEATURES 

source 



CDS 



AX359921 2082 bp DNA linear PAT 13-FEB-2002 

Sequence 1 from Patent WO0202774. 

AX359921 

AX359921.1 GI: 18675563 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Leiby^K.R., Cook, W.J. and Spaltmann,F. 

32626;, a human udp-glycosyltransf erase and uses thereof 
Patent: WO 0202774-A 1 lO-JiW-2002; 
Millennium Pharmaceuticals, Inc. (US) 

Location/Qualifiers 

1. .2082 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 
145. .1716 

/ not e= "unnamed protein product" 
/codon_start=l 
/protein_id="CAD23353 . 1" 
/db_xref="GI: 18675564" 

/translation="MVGQRVLLLVAFLLSGVLLSEAAKILTISTLGGSHYLLLDRVSQ 
ILQEHGHNVTMLHQSGKFLIPDIKEEEKSYQVIRWFSPEDHQKRIKKHFDSYIETALD 
. GRKESEALVKLMEIFGTQCSYLLSRKDIMDSLKNENYDLVFVEAFDFCSFLIAEKLVK 
PFVAILPTTFGSLDFGLPSPLSYVPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQ 
STFDNTIKEHFPEGSRPVLSHLLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPI 
KPVPQDLDNFIANFGDAGFVLVAFGSMLNTHQSQEVLKKMHNAFAHLPQGVIWTCQSS 
HWPRDVHLATNVKIVDWLPQSDLLAHPSIRLFVTHGGQNSVMEAIRHGVPMVGLPVNG 
DQHGNMVRWAKN YGVS I RLNQVTADTLTLTMKQVI EDKRYKSAWAASVI LHSQPLS 
PAQRLVGWIDHILQTGGATHLKPYAFQQPWHEQYLIDVFVFLLGLTLGTMWLCGKLLG 
WARWLRGARKVKKT " 



ORIGIN 



Query Match 55.2%; Score 1281.6; DB 6; Length 2082; 

Best Local Similarity 85.9%; Pred. No. 0; 

Matches 1422; Conservative 0; Mismatches 234; Indels 0; Gaps 



0; 



Qy 



7 CCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAG 66 



I I I I I I M i I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I II I I I I II I I 
84 CCTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAG 143 

67 CATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTC 126 
I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
144 CATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTC 203 

127 AGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGA 186 

I I I I M I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I II I I I I M I I I I I I I 
204 AGAGGCTGCCAA/^TCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGA 263 

187 CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAG 246 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I II I I I I I I I II I I 
264 CCGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGG 323 

247 AGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCT 306 

I III II I I M II Mill MM M II II II I II II II I I I M II Ml I 

324 AAAGTTTTTGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTT 383 

307 TGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAAC 366 

I II II II II I II I II II II I I II II II I II II II I II I II I II 

384 TTCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGC 443 

367 TTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCA 426 
II I M M I II I II I II M II II I I I I II I MM II 

444 ATTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCA 503 

427 GTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGA 486 

II III II I I III II II II II II I II II M II II I II II II II I M II I M 
504 ATGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTATGA 563 

487 CATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAA 546 

II I I I MUM I II II I I II II M I II II M I I M II I II II I I II III 
564 TCTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAA 623 

547 GCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCC 606 

II II II I II II II II II I II II I II II II I II II II M II I II I I II II II III 
624 ACCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCC 683 

607 CTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 666 

II II II I I I II II I I I I II II II II II II I II M II II I II II M II II II II II II I I 

684 CTTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 743 

667 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTAC 726 

II II II II II II II II II II II II II II I M I I II II I III I II I I II II I II I I 
744 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTAC 803 

727 ATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 786 

II II I I II II I II I I II I II I I I II II II II II I I II I I I II M II I I II I II II II I 
804 ATTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 863 

787 TCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACC 846 

II II II II I II II II II II II II I I II II II II I I II II II I I II I I II II II II 
864 TCTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCC 923 

847 TCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 906 
II II I II I II I II M II II II II II I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I 



Db 924 CCTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 983 

Qy 907 ACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTT 966 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I 
Db 984 ACAAGACTTGGACAACTTCATTGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTT 104 3 

Qy 967 GGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTT 1026 

I M I I I I I I I I I I II II II II I I I I I II I I I I I I I II I I I I I I I II II 

Db 1044 TGGCTCCATGTTGAACACCCATCAGTCCCAGGAAGTCCTCAAGAAGATGCACAATGCCTT 1103 

Qy 1027 TGCTCACCTACCCCAAGGGGTGATATGG7SAGTGTCAGTGTTCTCATTGGCCCAAAGATGT 1086 

III 1 1 1 1 1 II mil 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1104 TGCCCACCTCCCTCAAGGAGTGATATGGACATGTCAGAGTTCTCATTGGCCCAGAGATGT 1163 

Qy 1087 CCACCTGGCTGCAAATGTG75J\AATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1146 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 1164 TCATTTGGCCACAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1223 

Qy 1147 CCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCA 1206 

I II I I I I I I M I M M I I I I M II II I I I I I I I I III I I I I I I I I I I I I I I I 
Db 1224 CCCCAGCATCCGTCTTTTTGTCACTCATGGTGGGCAGAACAGCGTAATGGAGGCCATCCG 1283 

Qy 1207 GCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCG 1266 

I I I I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I II I I I I I I I M I I I 
Db 1284 TCATGGTGTGCCCATGGTGGGATTACCAGTCAATGGAGACCAGCATGGAAACATGGTCCG 1343 

Qy 1267 AGTAGAAGCC7VAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATT 1326 

I I I M M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I 
Db 134 4 AGTAGTAGCCAAAAATTATGGTGTCTCTATCCGGTTGAATCAGGTCACAGCCGACACACT 1403 

Qy 1327 GGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGC 1386 

I I I I I I I I I I I I I I I I I I I I I i I I I I I I i I I I I I I I I I II I Mill II 
Db 1404 GACACTTACAATGAAACAAGTCATAGAAGACAAGAGGTACAAGTCGGCAGTGGTGGCAGC 1463 

Qy 1387 CAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGA 1446 

I I I I I I I I I II I I I III II II II I I I I II I I I I I I I I I I I I I I I II I I I I I II 
Db 14 64 CAGTGTCATCCTGCACTCTCAGCCCCTGAGCCCCGCACAGCGGCTGGTGGGCTGGATCGA 1523 

Qy 1447 CCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTG 1506 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I II 
Db 1524 CCACATCCTCCAGACTGGGGGAGCGACGCACCTCAAGCCCTATGCCTTCCAGCAGCCTTG 1583 

Qy 1507 GCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCT 1566 

I I II I I I I I I I I I I I I II II I I I I I I I I I I M I I I I I I II I I I I I I I I I III I 
Db 1584 GCATGAGCAGTACCTCATTGATGTCTTTGTGTTTCTGCTGGGGCTCACTCTGGGCACTAT 1643 

Qy 1567 ATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGT 1626 

I I I I I I I I I II I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1644 GTGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGCCAGGTGGCTGCGTGGGGCCAGGAAGGT 1703 

Qy 1627 GAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGG 1662 

I I I I I I I I I I I- 1 I I I I I I I I I I I I I M II 
Db 1704 GAAGAAGACATGAGGCTAGGTGTAGCCTTGGGTGAG 1739 



RESULT 9 
AX155211 



LOCUS AX155211 2797 bp DNA linear PAT 22-JUN-2001 

DEFINITION Sequence 1 from Patent WO0138505. 

ACCESSION AX155211 

VERSION AX155211.1 GI: 14536690 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Adler^D.A., Dong^D.L.^ Pownder^S., Gao,Z. and Conklin^D.C. 

TITLE Secreted protein, zalpha37 

JOURNAL Patent: WO 0138505-A 1 31-MAY-2001; 
ZymoGenetics, Inc. (US) 
FEATURES Location/Qualifiers 
source 1. .2797 

/organism="Homo sapiens" 
/mo l_type=" Unas signed DNA" 
/db_xref ="taxon : 9606" 
CDS 119. .1690 

/note=" unnamed protein product" 
/codon_start=l 
/protein_id="CAC42674 .1" 
/db_xref="GI : 14536691" 

/translation="MVGQRVLLLVAFLLSGVLLSEAAKILTISTLGGSHYLLLDRVSQ 
ILQEHGHNVTMLHQSGKFLIPDIKEEEKSYQVIRWFSPEDHQKRIKKHFDSYIETALD 
GRKESEALVKLMEIFGTQCSYLLSRKDIMDSLKNENYDLVFVEAFDFCSFLIAEKLVK 
PFVAILPTTFGSLDFGLPSPLSYVPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQ 
STFDNTIKEHFPEGSRPVLSHLLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPI 
KPVPQDLDNFIANFGDAGFVLVAFGSMLNTHQSQEVLKKMHNAFAHLPQGVIWTCQSS 
HWPRDVHLATNVKIVDWLPQSDLLAHPSIRLFVTHGGQNSVMEAIRHGVPMVGLPVNG 
DQHGNMVRWAKNYGVS I RLNQVTADTLTLTMKQVI EDKRYKS AWAASVI LHSQPLS 
PAQRLVGWI DHI LQTGGATHLKPYAFQQPWHEQYLI DVFVFLLGLTLGTMWLCGKLLG 
WARWLRGARKVKKT " 

ORIGIN 

Query Match 55.2%; Score 1281.6; DB 6; Length 2797; 

Best Local Similarity 85.9%; Pred. No. 0; 

Matches 1422; Conservative 0; Mismatches 234; Indels 0; Gaps 0; 



Qy 7 CCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAG 66 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 58 CCTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAG 117 

Qy 67 CATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTC 126 

Mill I I I II 14^14 I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I 

Db 118 CATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTC 177 

Qy 127 AGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGA 186 

I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 178 AGAGGCTGCCAAAATCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGA 237 

Qy 187 CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAG 246 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I II I I 

Db 238 CCGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGG 297 

Qy 247 AGGTCCTTTTATGCCAGATTTTA/SkAAAGGAAGAAAAATCATATC/AGTTATCAGTTGGCT 306 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

-Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



298 



307 



358 



367 



418 



427 



478 



487 



538 



547 



598 



607 



658 



667 



718 



727 



778 



787 



838 



847 



898 



907 



958 



967 



I III II I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I III I 

AAAGTTTTTGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTT 357 

TGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAAC 366 
I I I I I I I I I I I I I.I I I I I I I I I M I I I I I I I I I I I II I I I I II 
TTCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGC 417 

TTTAGGTGGCAGAGGA/WS^TTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCA 426 

I I I I I I I I I I I I II II I I II I I I I I I I II I I I I II 
ATTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCA 477 

GTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGA 486 

II III I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
ATGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTATGA 537 

CATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAA 546 

I I I I I I I I I I I I Mill I I I I I I I I M I I M I I I I II M I I II II I III 

TCTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAA 597 

GCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCC 606 

I I I I I I 1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I III 
ACCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCC 657 

CTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 666 
I M I I I I I I I I I M I I I I I I I I I I I I I M I I I I II I II I I I I I I I I I M I I I I I I II I I 
CTTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 717 

AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTAC 726 
I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I 
AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTAC 777 

ATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 786 
M I I I I I I I I I I I I I I I I I M I I I I I I I I M I I M I I I I I I I M I M I I I I I I I I M I 

ATTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 837 

TCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACC 846 
I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II II 
TCTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCC 897 

TCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 906 

I I I I I I I M I I I I I I I I M I I I I I I II I M I II I I I I I I I I I I I M I I I I I I I I I I I I 
CCTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 957 

ACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTT 966 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
ACAAGACTTGGACAACTTCAT.XGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTT 1017 



GGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTT 1026 

I M I I I I M I I I I I I M I I I I I MM II Mill I II I I II I II I I II I 

1018 TGGCTCCATGTTGAACACCCATCAGTCCCAGGAAGTCCTCAAGAAGATGCACAATGCCTT 1077 

1027 TGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGT 1086 

III Mill II Mill II II II II II II II II II II II II II II II I II II II 
1078 TGCCCACCTCCCTCAAGGAGTGATATGGACATGTCAGAGTTCTCATTGGCCCAGAGATGT 1137 

1087 CCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1146 

II II II 11 II I II II II II II I II II I I II II II II II II II I II I II II I II I I 



Db 



1138 TCATTTGGCCACAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1197 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1147 CCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCA 1206 

III I II I I I II I I I I I I I I I I I II II I I I I I I I I III I I I I I I I I I I II I I I 
1198 CCCCAGCATCCGTCTTTTTGTCACTCATGGTGGGCAGAACAGCGTAATGGAGGCCATCCG 1257 

1207 GCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCG 1266 

I I I I I I n I I I I I I I I I II I I II II I I I I I I I I II I II I I I M I I I I I I I 
1258 TCATGGTGTGCCCATGGTGGGATTACCAGTC/ikATGGAGACCAGCATGGAAACATGGTCCG 1317 

1267 AGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATT 1326 

II II I I II I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
1318 AGTAGTAGCCAAAAATTATGGTGTCTCTATCCGGTTGAATCAGGTCACAGCCGACACACT 1377 

1327 GGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGC 1386 

I I I I I I II I II I I II I I II I I I II II 11 I I I I II I II I I I I I I I I I M 
1378 GACACTTACAATGAAACAAGTCATAGAAGACAAGAGGTACAAGTCGGCAGTGGTGGCAGC 1437 

1387 CAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGA 1446 

I I I I II II I I II I I Ml II II II MINI I II I I I I M I I I I M I M I I I I M 

1438 CAGTGTCATCCTGCACTCTCAGCCCCTGAGCCCCGCACAGCGGCTGGTGGGCTGGATCGA 1497 

1447 CCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTG 1506 

INI I I I I I I I I I I Mill I II I I II I I II I II I I I I I I I I III I I I I I I I I II 
1498 CCACATCCTCCAGACTGGGGGAGCGACGCACCTCAAGCCCTATGCCTTCCAGCAGCCTTG 1557 

1507 GCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCT 1566 

II M I I II I I I II I I I M II I M M I M I M M I I I I II M I II I I M I III I 
1558 GCATGAGCAGTACCTCATTGATGTCTTTGTGTTTCTGCTGGGGCTCACTCTGGGCACTAT 1617 

1567 ATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGT 1626 

II I I II II M I II I II II II II I Ml I I II I I I I I I I M I I M I M I Mill 

1618 GTGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGCCAGGTGGCTGCGTGGGGCCAGGAAGGT 1677 

1627 GAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGG 1662 

II II MUM I II I Mill M II M II I I 
1678 GAAGAAGACATGAGGCTAGGTGTAGCCTTGGGTGAG 1713 
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AX714604 2823 bp DNA linear PAT 15-APR-2003 

Sequence 1288 from Patent EP1293569. 

AX714604 

AX714604 . 1 GI : 29889557 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Isogai^T.^ Sugiyama^T., Otsuki,T.^ Wakamatsu, A. , Sato,H., Ishii^S., 
Yamamoto, J. I . , Isono^Y., Hio^Y., Otsuka,K., Nagai,K.^ Irie^R., 
Tamechika, I . , Seki,N., Yoshikawa,T. , Otsuka,M., Nagahari,K. and 
Masuho, Y. 
Full-length cDNAs 

Patent: EP 1293569-A 1288 19-MAR-2003; 



Helix Research Institute (JP) ; Research Association for 
Biotechnology (JP) 
FEATURES Location/Qualifiers 
source 1. .2823 

/organism="Homo sapiens" 
/mol_type="unas signed DNA" 
/db_xref="taxon: 9606" 

ORIGIN 

Query Match 55.2%; Score 1281.6; DB 6; Length 2823; 

Best Local Similarity 85.9%; Pred. No. 0; 

Matches 1422; Conservative 0; Mismatches 234; Indels 0; Gaps 0; 



Qy • 7 CCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAG 66 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 97 CCTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAG 156 

Qy 67 CATGGCTGGGCAGCGAGTGGTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTC 126 

I II II I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 157 CATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTC 216 

Qy 127 AGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGA, 186 

I II I I II I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 217 AGAGGCTGCCAAAATCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGA 276 

Qy 187 CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAG 246 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I II I I 
Db 277 CCGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGG 336 

Qy 247 AGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCT 306 * 

I III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 

Db 337 AAAGTTTTTGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTT 396 

Qy 307 TGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAAC 366 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II 
Db 397 TTCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGC 456 

Qy 367 TTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCA 426 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 

Db 457 ATTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCA 516 

Qy 427 GTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGA 486 

II III I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 517 ATGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTGTGA 576 

Qy„...... 487 CATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGAT.TGCTGAGAAGCTTGGGAA 546 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 577 TCTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAA 636 

Qy 547 GCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCC 606 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 637 ACCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCC 696 

Qy 607 CTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 666 

I II I I I I I I I I I II II I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I 
Db 697 CTTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 756 



Qy 667 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTAC 726 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I III I I I I I I I I I I I I I I 

Db 757 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTAC 816 

Qy 727 ATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 786 

II II I I I I I I I I II I I I II II II I ll.l I M I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 817 ATTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 876 

Qy 787 TCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACC 846 

I I I I I I I I LI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II 
Db 877 TCTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCC 936 

Qy 847 TCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 906 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II II II I II I II I II I II I II I 
Db 937 CCTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAA7WVCCTATTAAACCAGTACC 996 

Qy 907 ACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTT 966 

I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 997 ACAAGACTTGGACAACTTCATTGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTT 1056 

Qy 967 GGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTT 1026 

I II I I I I I I I I I I I I I I I I I I I MM II I I I I I I II I I II II II I II I 

Db 1057 TGGCTCCATGTTGAACACCCATCAGTCCCAGGAAGTCCTCAAGAAGATGCACAATGCCTT 1116 

Qy 1027 TGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGT 1086 

III II II I II I I I II I II II I I I I I I II II I I II I I I I I I II I II I I I I I II 

Db 1117 TGCCCACCTCCCTCAAGGAGTGATATGGACATGTCAGAGTTCTCATTGGCCCAGAGATGT 1176 

Qy 1087 CCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1146 

II I I II II I II II II II I II I II I II I II II I I I II I II I I I II II II I I I I II I 

Db 1177 TCATTTGGCCACAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1236 

Qy 1147 CCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCA 1206 

III I I II I II I I I I II II II II II II II I II I I I III I I I I II II II I II I 
Db 1237 CCCCAGCATCCGTCTTTTTGTCACTCATGGTGGGCAGAACAGCGTAATGGAGACCATCCG 1296 

Qy 1207 GCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCG 1266 

II II II I II I II II I II II I I II II I II II I II I II II II II I I II I II I 
Db 1297 TCATGGTGTGCCCATGGTGGGATTACCAGTCAATGGAGACCAGCATGGAAACATGGTCCG 1356 

Qy 1267 AGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATT 1326 

II II I I I I II I I I I I II I I II II II I I I I I II I I I II I I II I II I 

Db 1357 AGTAGTAGCCAAAAATTATGGTGTCTCTATCCGGTTGAATCAGGTCACAGCCGACACACT 1416 

Qy 1327 GGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGC 1386 

•^..U I II II I M II I II I II II I II I II I II II II I I II II II l.^^l II II II 
Db 1417 GACACTTACAATGAAACAAGTCATAGAAGACAAGAGGTACAAGTCGGCAGTGGTGGCAGC 1476 

Qy 1387 CAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGA 1446 

I I I I I I II I I I I II III II II II I I I I I I II II I II I I I I I I I I II I I I I I II 
Db 1477 CAGTGTCATCCTGCACTCTCAGCCCCTGAGCCCCGCACAGCGGCTGGTGGGCTGGATCGA 1536 

Qy 1447 CCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTG 1506 

MM I II II I II I I II II I II II II I II I I II II II I II II I II I I II I II I I II 
Db 1537 CCACATCCTCCAGACTGGGGGAGCGACGCACCTCAAGCCCTATGTCTTCCAGCAGCCTTG 1596 

Qy 1507 GCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCT 1566 



Db 



I I I I I I I I I I I I i I I I II II I I I I I II I I I I I II I II I I I I I I I M I II III I 
1597 GCATGAGCAGTACCTCATTGATGTCTTTGTGTTTCTGCTGGGGCTCACTCTGGGCACTAT 1656 



Qy 1567 ATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGT 1626 

I I II II II I II I I II I I I I I II I III I I II II I I I II I I I I I I I II I I I I I I 
Db 1657 GTGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGCCAGGTGGCTGCGTGGGGCCAGGAAGGT 1716 

Qy 1627 GAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGG 1662 

MM II II M I I II I II II II I I I I II I I 
Db 1717 GAAGAAGACATGAGGCTAGGTGTAGCCTTGGGTGAG 1752 
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AK057066 2823 bp ' iriRNA linear PRI 30-JAN-2004 

Homo sapiens cDNA FLJ32504 fis, clone SMINT1000016, weakly similar 
to 2-HYDROXYACYLSPHINGOSINE 1- BETA- GALACTOSYLT RAN SFERASE PRECURSOR 
(EC 2.4.1.45) . 
AK057066 

AK057066 .1 GI : 16552643 

oligo capping; fis (full insert sequence) . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Ota^T., Suzuki^ Y., Nishikawa, T . , Otsuki,T., Sugiyama^T., Irie,R., 
Wakamatsu, A. , Hayashi^K., Sato,H., Nagai^K.^ Kimura,K., Makita^H., 
Sekine,M., Obayashi^M., Nishi,T., Shibahara, T . , Tanaka,T.^ 
Ishii^S., Yamamoto,J.^ Saito,K., Kawai,Y., Isono^Y., Nakamura,Y., 
Nagahari,K., Murakami, K., Yasuda,T., Iwayanagi, T. , Wagatsuma^M. , 
Shiratori, A. , Sudo,H., Hosoiri,T., Kaku^Y., Kodaira^H., Kondo^H., 
Sugawara,M., Takahashi, M. , Kanda^K., Yokoi^T., Furuya^T., 
Kikkawa^E., Omura^Y., Abe^K., Kamihara,K., Katsuta^N., Sato^K., 
Tanikawa,M. , Yamazaki^M., Ninomiya,K., Ishibashi^ T . , Yamashita, H. , 
Murakawa^K., Tujimori^K., Tanai^H., Kimata^M., Watanabe^M., 
Hiraoka^S., Chiba^Y., Ishida,S., Ono^Y., Takiguchi, S . , Watanabe^S., 
Yosida^M., Hotuta,T., Kusano,J., Kanehori,K.^ Takahashi-Fujii, A. , 
Hara,H., Tanase^T., Nomura, Y., Togiya^S., Komai,F., Hara,R., 
Takeuchi,K., Arita^M., Imose,N., Musashino, K. , Yuuki,H., Oshima,A. , 
Sasaki, N., Aotsuka,S., Yoshikawa, Y. , Matsunawa, H . , Ichihara,T., 
Shiohata,N., Sano,S., Moriya,S., Momiyama,H.^ Satoh,N., Takami,S., 
Terashima, Y. , Suzuki, O., Nakagawa,S., Senoh,A. , Mizoguchi, H. , 
Goto,Y., Shimizu,F., Wakebe,H., Hishigaki, H. , Watanabe,T., 
Sugiyaraa,A. , Takemoto,M., Kawakami,B., Yamazaki,M., Watanabe,K., 
Kumagai, A. , . Itakura, S. , Fukuzumi,Y., Fujimori, Y., Komiyama,M., 
Tashiro,H., Tanigami,A. , Fujiwara,T., Ono,T., Yamada,K., Fujii,Y., 
Ozaki,K., Hirao,M., Ohmori,Y,, Kawabata,A., Hikiji,T., Kobatake,N., 
Inagaki,H., Ikema,Y., Okamoto,S., Okitani,R., Kawakami,T., 
Noguchi,S., Itoh,T., Shigeta,K., Senba,T., Matsumura, K. , 
Nakajima,Y., Mizuno,T., Morinaga,M., Sasaki, M., Togashi,T., 
Oyama,M., Hata,H., Watanabe,M., Komatsu,T., Mizushima-Sugano, J. , 
Satoh,T., Shirai,Y., Takahashi, Y. , Nakagawa,K., Okumura,K., 
Nagase,T., Nomura, N., Kikuchi,H., Masuho,Y., Yamashita, R. , 
Nakai,K., Yada,T., Nakamura,Y., Ohara,0., Isogai,^T. and Sugano,S. 
Complete sequencing and characterization of 21,243 full-length 
human cDNAs 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



CDS 



Nat. Genet. 36 (1), 40-45 (2004) 
14702039 

2 

Kawakami,B., Sugiyama, A. , Takemoto,M. , Sugiyama^T. , Irie^R., 
Otsuki,T., Sato,H., Wakamatsu, A. , Ishii,S., Yamamoto, J. , Isono^Y., 
Kawai-Hio, Y. , Saito,K., Nishikawa, T. , Kimura.K.^ Yamashita, H - , 
Matsuo.K., Nakamura^ Y. , Sekine^M., Kikuchi,H., Kancia,K., 
Wagatsuma^M. , Murakawa^K., Kanehori,K., Takahashi-Fujii, A. , 
Oshima,A., Suzuki, Y., Sugano,S., Nagahari^K., Masuho,Y., Nagai,K. 
and Isogai^T. 

NEDO human cDNA sequencing project 

Unpublished 

3 (bases 1 to 2823) 

Isogai^T., Otsuki,T. and Sugiyama,T. 
Direct Submission 

Submitted (24-OCT-2001) Takao Isogai, Helix Research Institute, 
Genomics Laboratory; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
(E-mail:genomics@hri.co. jp, Tel : 81-438-52-3975, Fax: 81-438-52-3986) 
NEDO human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology (RAB) ; cDNA library 
construction: Helix Research Institute (HRI) (supported by Japan 
Key Technology Center etc.); 5'- & 3 '-end one pass sequencing: RAB, 
HRI, and Biotechnology Center, National Institute of Technology and 
Evaluation; clone selection for full insert sequencing: RAB and 
HRI. 

Location/Qualifiers 
1. .2823 

/organisra="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/clone="SMINT1000016" 
/tissue__type=" small intestine" 
/ cl one_l ib= " SMI NT 1 " 
/note="cloning vector: pME18SFL3" 
158. .1729 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="BAB71358 . 1" 
/db_xref ="GI : 16552644" 

/translation="MVGQRVLLLVAFLLSGVLLSEAAKILTISTLGGSHYLLLDRVSQ 
ILQEHGHNVTMLHQSGKFLIPDIKEEEKSYQVIRWFSPEDHQKRIKKHFDSYIETALD 
GRKESEALVKLMEIFGTQCSYLLSRKDIMDSLKNENCDLVFVEAFDFCSFLIAEKLVK 
PFVAILPTTFGSLDFGLPSPLSYVPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQ 
STFDNTIKEHFPEGSRPVLSHLLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPI 
KPVPQDLDNFIANFGDAGFVLVAFGSMLNTHQSQEVLKKMHNAFAHLPQGVIWTCQSS 
HWPRDVHLATNVKIVDWLPQSDLLAHPSIRLFVTHGGQNSVMETIRHGVPMVGLPVNG 
DQHGNMVRWAKNYGVSIRLNQVTADTLTLTMKQVIEDKRYKSAWAASVILHSQPLS 
PAQRLVGWIDHILQTGGATHLKPYVFQQPWHEQYLIDVFVFLLGLTLGTMWLCGKLLG 
WARWLRGARKVKKT " 



ORIGIN 



Query Match 55.2%; Score 1281.6; DB 9; Length 2823; 

Best Local Similarity 85.9%; Pred. No. 0; 

Matches 1422; Conservative 0; Mismatches 234; Indels 0; Gaps 0; 



Qy 



7 CCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAG 66 



Db 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I 
97 CCTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAG 156 



Qy 67 CATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTC 126 

II III I I I I I I I I I II I I I I I I I I I I I 11 I I I I I I I I I i II I I I I II I I I I II I 

Db 157 CATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTC 216 

Qy 127 AGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGA 186 

I I I II II II II I I I I I I I.I I M I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I II 

Db 217 AGAGGCTGCC7\AAATCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGA 276 

Qy 187 CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAG 246 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I II I I 

Db 277 CCGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGG 336 

Qy 247 AGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCT 306 

I I I I I I I II I I I I I I I I II I I I M I I I I I M I I I I I I I I I I I I I I I I . 

Db 337 AAAGTTTTTGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTT 396 

Qy 307 TGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAAC 366 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II 

Db 397 TTCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGC 456 

Qy 367 TTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCA 426 

I I I M I I I I I I I M I I I I II I I I I I INI Mil II 

Db 457 ATTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCA 516 

Qy 427 GTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGA 486 

II III I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 517 ATGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTGTGA 576 

Qy 487 CATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAA 546 

I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I II I I I II II I I III 

Db 577 TCTGGtATTTGTTGAAGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAA 636 

Qy 547 GCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCC 606 

I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 637 ACCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCC 696 

Qy 607 CTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 666 

I I I I I I I I I I III I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M 

Db 697 CTTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 756 

Qy 667 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTAC 726 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I 

Db 757 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTAC 816 

Qy 727 ATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 786 

I M I I II I I I I I I I I II I I I I I I II II I I I I I I I I I I I II I I I I I I II I I I I I I I I I I 

Db 817 ATTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 876 

Qy 787 TCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACC 846 

I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I II II 

Db 877 TCTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCC 936 

Qy 847 TCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 906 

I II I II I I I I I I I I II II II I II I I I II I I II II I II I I II I II II II I II I I II I II 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



937 CCTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 996 



907 



997 



967 



ACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTT 966 
I I I I I I I I I I I I I I I I I I I I I i I i I I I I I I I I I I I II I I II I II II I II I I I I 
ACAAGACTTGGACAACTTCATTGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTT 1056 



GGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTT 1026 

I I I I I I I I I I I I I I II I I I I I I MM II I I I II II I II II II I I I II I 
1057 TGGCTCCATGTTGAACACCCATCAGTCCCAGGAAGTCCTCAAGAAGATGCACAATGCCTT 1116 

1027 TGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGT 1086 

III I II M II II I II I II I II I II I I II II I I M II I I I II II I I I I II I II 

1117 TGCCCACCTCCCTCAAGGAGTGATATGGACATGTCAGAGTTCTCATTGGCCCAGAGATGT 1176 

1087 CCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 114 6 

II I I II I II II I I II II II II II II II II I I I II II II I II I II I II II II II I I 
1177 TCATTTGGCCACAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1236 

1147 CCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCA 1206 
Ml I II II I II M I II II I I II II II I II I I M I Ml II I I I I M II II I I 

1237 CCCCAGCATCCGTCTTTTTGTCACTCATGGTGGGCAGAACAGCGTAATGGAGACCATCCG 1296 

1207 GCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCG 1266 

I I I II I II I I I I I II II II I I M II II I I II I I I II M II II II I II II I 

1297 TCATGGTGTGCCCATGGTGGGATTACCAGTCAATGGAGACCAGCATGGAAACATGGTCCG 1356 

1267 AGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATT 1326 

I II I I I II II M II I II II II II II I I II I I I II II I II II II I I 

1357 AGTAGTAGCCAAAAATTATGGTGTCTCTATCCGGTTGAATCAGGTCACAGCCGACACACT 1416 

1327 GGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGC 1386 

I I II II II II II I I I M II M II I II I II I II II II I I II I I M I I II 

1417 GACACTTACAATGAAACAAGTCATAGAAGACAAGAGGTACAAGTCGGCAGTGGTGGCAGC 1476 

1387 CAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGA 1446 

I II II II II II I I I III M II II II II I I II II II II I I II II II I II I I I M 

1477 CAGTGTCATCCTGCACTCTCAGCCCCTGAGCCCCGCACAGCGGCTGGTGGGCTGGATCGA 1536 

1447 CCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTG 1506 

II I I I I II I I I I II I I II I II II I II I II I II II II II I I I I II I II II I I I I II 
CCACATCCTCCAGACTGGGGGAGCGACGCACCTCAAGCCCTATGTCTTCCAGCAGCCTTG 1596 



1537 



1566 



1507 GCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCT 
I II II I I II II I II I I II II I II II II I II II II II M II I II I II I M III I 

1597 GCATGAGCAGTACCTCATTGATGTCTTTGTGTTTCTGCTGGGGCTCACTCTGGGCACTAT 1656 

1567 ATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGT 1626 

I I II II I II I II I I I I II II II I III I I I I II I II II M II II I II I II II I 
1657 GTGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGCCAGGTGGCTGCGTGGGGCCAGGAAGGT 1716 



1627 



1662 



GAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGG 

I I II I I M II MM II II I II II M II I I 
1717 GAAGAAGACATGAGGCTAGGTGTAGCCTTGGGTGAG 1752 
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REMARK 
COMMENT 



GI: 46250395 



(human) 



Craniata ; Vertebra ta ; Euteleos tomi ; 
Catarrhini; Hominidae; Homo. 



BC068446 2759 bp mRNA linear PRI 30-JUN-2004 

Homo sapiens hypothetical protein FLJ34658, mRNA (cDNA clone 
MGC:86988 IMAGE: 5295502 ) , complete cds. 
BC068446 
BC068446.1 
MGC- 

Homo sapiens 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 

1 (bases 1 to 2759) 

Strausberg^ R. L. , Feingold, E . A. , Grouse, L.H. , Derge, J.G. , 
Klausner^R.D, , Collins, F. S . , Wagner, L., Shenmen, CM. , Schuler, G.D. , 
Altschul, S. F. , Zeeberg,B., Buetow,K.H., Schaef er, C. F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo,M. F. , Casavant, T . L. , 
Scheetz,T.E. , Browns tein,M. J. , Usdin,T. B. , Toshiyuki, S . , 
Carninci,P., Prange,C., Raha,S.S., Loquellano,N. A. , Peters, G. J., 
Abramson, R. D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan, K. J. , Malek, J. A. , Gunaratne, P.H. , Richards, S. , 
Worley,K.C., Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D. K. , Muzny,D.M., Sodergren, E . J. , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues , S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C, Shevchenko, Y. , 
Bouf f ard, G. G. , Blakesley, R.W. , Touchman, J.W. , Green, E. D. , 
Dickson,M.C. , Rodriguez, A, C. , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield, Y.S. , Krzywinski, M. I . , Skalska,U. , Smailus,D.E. , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
12477932 

2 (bases 1 to 2759) 
Strausberg, R. 
Direct Submission 

Submitted ( 02-APR-2004 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 

USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact: MGC help desk 
Email : cgapbs-rSmail . nih . gov 

Tissue Procurement: Miklos Palkovits, M.D., Ph.D. 

cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki 
Toshiyuki and Piero Carninci (RIKEN) 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M., Schmutz, J., Grimwood, J., Rodriguez, A., and Myers, 
R. M. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/ LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 168 Row: c Column: 18 



FEATURES 

source 



gene 



CDS 



This clone was selected for full length sequencing because it 
passed the following selection criteria: matched. itiRNA gi: 24432082. 

Location/Qualifiers 

1. .2759 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/clone="MGC: 86988 IMAGE: 5295502" 
/tissue_type="Testis" 
/clone_lib="NIH_MGC_97 " 
/lab_host="DH10B" 
/note="Vector : pBluescript" 
1. .2759 

/gene="FLJ34658" 
/db_xref="LocusID: 133688" 
61. .1632 
/gene="FLJ34658" 
/codon_start=l 

/product="hypothetical protein FLJ34658" 
/protein_id="AAH68446. 1" 
/db_xref="GI : 46250396" 
/db_xref="LocusID: 133688" 

/translation="MVGQRVLLLVAFLLSGVLLSEAAKILTISTLGGSHYLLLDRVSQ 

ILQEHGHNVTMLHQSGKFLIPDIKEEEKSYQVIRWFSPEDHQKRIKKHFDSYIETALD 

GRKESEALVKIiMEIFGTQCSYLLSRKDIMDSLKNENYDLVFVEAFDFCSFLIAEKLVK 

PFVAILPTTFGSLDFGLPSPLSYVPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQ 

STFDNTIKEHFPEGSRPVLSHLLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPI 

KPVPQDLDNFIANFGDAGFVLVAFGSMLNTHQSQEVLKKMHNAFAHLPQGVIWTCQSS 

HWPRDVHIAT^^^^<IVDWLPQSDLLAHP S I RLFVTHGGQNSVMEAI RHGVPMVGL^ 

DQHGNMVRWAKN YGVS I RLNQVTADTLTLTMKQVI EDKRYKS AWAAS VI LHSQPLS 

PAQRLVGWIDHILQTGGATHLKPYAFQQPWHEQYLIDVFVFLLGLTLGTMWLCGKLLG 

WARWLRGARKVKKT " 



ORIGIN 



Query Match 55.2%; 
Best Local Similarity 85.9%; 
Matches 1421; Conservative 



Score 1280.6; 
Pred. No. 0; 
0; Mismatches 



DB 9; Length 2759; 



234; Indels 



0; Gaps 



0; 



Qy 

Db 



8 CTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAGC 67 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I 
1 CTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAGC 60 



Qy 

Db 



68 ATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTCA 127 
I I I I M I II I I I I I I II I I I I I I I II II I I I I I I M I I M I I II M I I I I I I I I 

61 ATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTCA 120 



Qy 

Db 

Qy 

Db 



128 GAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGAC 187 

I I I I I I I I II I I I M I I I I I I II I I I I I I I I I I I I I I I I I II I I I MINI I I I M 

121 GAGGCTGCCAAAATCCTGACT^ATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGAC 180 

188 CGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAGA 247 

II I M I I I I I I M I M M I I I I II I I I I U M I I I II I I II I I I II I II 

181 CGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGGA 240 



Qy 248 
Db 241 



307 
300 



308 GCACCTGAAGATCATCAAAGAGAATTTAA7WVGAGTTTTGATTTCTTTCTGGAAG7\AACT 367 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II 

301 TCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGCA 360 

368 TTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCAG 427 

I I I I I I I I i I I I I I .1 Ml II I I I I I I I II I I I I 11 

361 TTGGATGGCAGAAAAG/^TCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCAA 420 

428 TGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGAC 487 

II III I I II III I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I II 

421 TGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTATGAT 480 

488 ATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGT^G 547 

II II I I I I I I I I II I II I I I I I I I I I I I II I I II I I I I I I I I I I I I III 

481 CTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAAA 540 

548 CCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCCC 607 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 CCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCCC 600 

608 TTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGA 667 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 TTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGA 660 

668 GTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTACA 727 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I 
661 GTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTACA 720 

728 TTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCTT 787 

I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
721 TTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCTT 780 

788 CTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACCT 847 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 
781 CTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCCC 840 

848 CTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCA 907 

I I I I I I I II II M I I I II I M I II I II I I I I I I I I I I I M I I M II II I II I II I I II I 

841 CTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCA 900 

908 CAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTTG 967 

II II II I I II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I MM 

901 CAAGACTTGGACAACTTCATTGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTTT 960 

968 GGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTTT 1027 

I I I I II I II I II I I I II MM I I II I II I I II I II II I II I I II I I I M 

961 GGCTCCATGTTGAACACCCATCAGTCCCAGGAAGTCCTCAAGAAGATGCACAATGCCTTT 102 0 

1028 GCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGTC 1087 

II I II II II II I II I II II II I II II II II I M II II II II II II I II I II 

1021 GCCCACCTCCCTCAAGGAGTGATATGGACATGTCAGAGTTCTCATTGGCCCAGAGATGTT 1080 

1088 CACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCAC 1147 
M MM I II II II M II II I I II I II I M M II I M I t I I I II M I II II II I II I 

1081 CATTTGGCCACAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCAC 1140 



Qy 1148 CCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCAG 1207 

II lllllllllll llllllll II II llllllll. Ill IIIIIIIIIIMIII 

Db 1141 CCCAGCATCCGTCTTTTTGTCACTCATGGTGGGCAGAACAGCGTAATGGAGGCCATCCGT 1200 

Qy 1208 CATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCGA 1267 

I II I II I II II II II I II II I II II II I M II I I I I II I I I I I I I I I I I I I 
Db 1201 CATGGTGTGCCCATGGTGGGATTACCAGTCAATGGAGACCAGCATGGAAACATGGTCCGA 1260 

Qy 1268 GTAGAAGCC7VAAAAGTTTGGTGTTTCTATTCAGTTA7VAGAAGCTCAAGGCAGAGACATTG 1327 

I I II II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 GTAGTAGCCAAAAATTATGGTGTCTCTATCCGGTTGAATCAGGTCACAGCCGACACACTG 1320 

Qy 1328 GCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGCC 1387 

I II II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 II I Mill III 
Db 1321 ACACTTACAATGAAACAAGTCATAGAAGACAAGAGGTACAAGTCGGCAGTGGTGGCAGCC 1380 

Qy 1388 AGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGAC 1447 

I I I I I I I I I I I I I III II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 1381 AGTGTCATCCTGCACTCTCAGCCCCTGAGCCCCGCACAGCGGCTGGTGGGCTGGATCGAC 1440 

Qy 1448 CACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTGG 1507 

III I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I III I I I I I I I I III 

Db 1441 CACATCCTCCAGACTGGGGGAGCGACGCACCTCAAGCCCTATGCCTTCCAGCAGCCTTGG 1500 

Qy 1508 CATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCTA 1567 

I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I M I I I I I I M I I III I 
Db 1501 CATGAGCAGTACCTCATTGATGTCTTTGTGTTTCTGCTGGGGCTCACTCTGGGCACTATG 1560 

Qy 1568 TGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGTG 1627 

I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 11 II I I 
Db 1561 TGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGCCAGGTGGCTGCGTGGGGCCAGGAAGGTG 1620 

Qy 1628 AAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGG 1662 

III I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 AAGAAGACATGAGGCTAGGTGTAGCCTTGGGTGAG 1655 
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AX327327 2086 bp DNA linear PAT 07-JAN-2002 

Sequence 12 from Patent WO0179468. 

AX327327 

AX327327.1 GI:18097873 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Policky, J. L. , Hafalia,A. , Burford,N., Ring^H.Z., Lai, P., 

Tribouley^C.M. , Yao^M.G., Yue,H., Tang,Y.T., Patterson, C. , Das,D., 

Sanjanwala,M. S . , Gandhi, A. R., Reddy,R., Khan, F. A., Baughn,M-R., 

Ramkumar,J., Griffin, J. A. and Au- Young, J. 

Drug metabolizing enzymes 

Patent: WO 0179468-A 12 25-OCT-2001; 

Incyte Genomics, Inc. (US) 

Location/Qualifiers 



source 1. .2086 

/organisin="Homo sapiens" 
/mol_type="unassigned DNA" 
/cib_xref="taxon:9606" 
/note="Incyte ID No: 2516747CB1" 

ORIGIN 

Query Match 55.2%; Score 1280; DB 6; Length 2086; 

Best Local Similarity 85.8%; Pred. No. 0; 

Matches 1421; Conservative 0; Mismatches 235; Ihdels 0; Gaps 0; 

Qy 7 CCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAGTGAG 66 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I U I I I I I I I I I Mill II I I I I I I I I 
Db 59 CCTTAGCCCGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCTGCTTCTGTGGAAGTGAG 118 

Qy 67 CATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTC 126 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I M M I I I I I I I i I I I I I I I I I I I I I 
Db 119 CATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTC 178 

Qy 127 AGAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGA 186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 179 AGAGGCTGCCAAAATCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGA 238 

Qy 187 CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAG 246 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II II MINI I II I I 
Db 239 CCGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGG 298 

Qy 247 AGGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCT 306 

I III II I I I I I I I I I I I I I I I I II I I III I I I I I I I I I I I I I I III I 

Db 299 AT^GTTTTTGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTT 358 

Qy 307 TGCACCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAAC 366 

I I I I I M I I I I I I I II I M I I II I I I I I I I I I I I I II I II I II 
Db 359 TTCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGC 418 

Qy 367 TTTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCA 426 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 

Db 419 ATTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCA 478 

Qy 427 GTGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGA 48 6 

II III II I I III I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I II 

Db . 479 ATGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTATGA 538 

Qy 487 CATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAA 546 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I III 
Db. 539 TCTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTTCCXGATTGCTGAGAAGCTTGTGAA 598 

Qy 547 GCCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCC 606 

I I I I I I II I I I I I I I I I I I I I I M M M I I I M I I I I I I I I I I I II I I I I I III 

Db 599 ACCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCC 658 

Qy 607 CTTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 666 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I II I I 
Db 659 CTTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCG 718 



Qy 



667 



AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTAC 
I I II I II II I I I II I I I I I I I I I I II I I II I I I I I II I III I I I I I I II I I I I I I 



726 



Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



719 AGTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTAC 778 



727 



779 



787 



ATTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 
I I i I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I t I 

ATTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCT 



786 



838 



846 



TCTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACC 
I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 
839 TCTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCC 898 



847 



899 



907 



959 



967 



TCTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 
I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I M II I I 
CCTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACC 



906 



958 



966 



ACAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTT 
I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I II I I I M I I I I I I I I II I 
ACAAGACTTGGACAACTTCATTGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTT 1018 



GGGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACT^TGCCTT 1026 

I I II I I II I I I I I I I I I II I I I I I I I M I I I M I II I I I M I I I I I I I 

1019 TGGCTCCATGTTGAACACCCATCAGTCCCAGGAAGTCCTCAAGAAGATGCACAATGCCTT 1078 

1027 TGCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGT 1086 

III I I I I I II Mill I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1079 TGCCCACCTCCCTCAAGGAGTGATATGGACATGTCAGAGTTCTCATTGGCCCAGAGATGT 1138 

1087 CCACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCA 1146 

II I I I I I I I I I I II M I I I II I I I I I I I I I I I M I I I II I I I M I I I I M I II I 

1139 TCATTTGGCCACAAATGTGAAAATTGTGGACTGGCTTCCTCGGAGTGACCTCCTGGCTCA 1198 

1147 CCCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCA 1206 
II I I I I I I I I I I I I I I I II I I II II I I I I I I I I III I I I I I I I I I I I I I I I 
CCCCAGCATCCGTCTTTTTGTCACTCATGGTGGGCAGAACAGCGTAATGGAGGCCATCCG 1258 



1199 



1266 



1207 GCATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCG 
I I I I I I I I I I I I I I I I II II I II II I I I I I I I I I I I II I I I I I I I I I I II 
TCATGGTGTGCCCATGGTGGGATTACCAGTCAATGGAGACCAGCATGGAAACATGGTCCG 1318 



1259 



1267 AGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGTVAGCTCAAGGCAGAGACATT 1326 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11111111 

1319 AGTAGTAGCCAAAAATTATGGTGTCTCTATCCGGTTGAATCAGGTCACAGCCGACACACT 1378 

1327 GGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGC 1386 

I I I I I I I I I I I II I I I I II I II I II I I I I I I I I I I I I I II I I I I I I II 

1379 GACACTTACAATGAAACAAGTCATAGAAGACAAGAGGTACAAGTCGGCAGTGGTGGCAGC 1438 

1387 CAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGA 1446 

I II I I I I I I I I I I I III II II II I I I I I I I I I I I I I I I I I I I II I I I I I I I II 

1439 CAGTGTCATCCTGCACTCTCAGCCCCTGAGCCCCGCACAGCGGCTGGTGGGCTGGATCGA 1498 

1447 CCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTG 1506 

I I I I I I I I I I I I I I Mill II I II I I I I II I I I II II I I I I III II II I I II II 

1499 CCACATCCTCCAGACTGGGGGAGCGACGCACCTCAAGCCCTATGCCTTCCAGCAGCCTTG 1558 

1507 GCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCT 1566 

I II II I I II II II II I II II II I I II II M II I I I I II I I II II II II I III I 

1559 GCATGAGCAGTACCTCATTGATGTCTTTGTGTTTCTGCTGGGGCTCACTCTGGGCACTAT 1618 



Qy 

Db 



1567 ATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGT 1626 

I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I II I I I I I I I I I I I I I II I I I 
1619 GTGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGCCAGGTGGCTGCGTGGGGCCAGGAAGGT 1678 



Qy 1627 GAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGG 1662 

I II I I II II I I I II I I I I I II II I I I I I I 
Db 1679 GAAGAAGACATGAGGCTAGGTGTAGCCTTGGGTGAG 1714 
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ORIGIN 



AX359923 1572 bp DNA linear PAT 13-FEB-2002 

Sequence 3 from Patent WO0202774. 

AX359923 

AX359923 .1 GI : 18675565 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo, 

1 

Leiby^K.R., Cook, W.J. and Spaltmann^F. 

32626, a human udp-glycosyltransf erase and uses thereof 
Patent: WO 0202774-A 3 lO-JAN-2002; 
Millennium Pharmaceuticals, Inc. (US) 

Location/Qualifiers 

1. .1572 

/organism="Homo sapiens" 
/mol_type=" Unas signed DNA" 
/db xref="taxon:9606" 



Query Match 52.2%; 
Best Local Similarity 85.7%; 
Matches 1347; Conservative 



Score 1212; DB 6; Length 1572; 
Pred. No. 1.8e-310; 
0; Mismatches 225; Indels 0; 



Gaps 



0; 



Qy 

Db 



68 ATGGCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTCTCA 127 
I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I II I I I I I I I I 

1 ATGGTTGGGCAGCGGGTGCTGCTTCTAGTGGCCTTCCTTCTTTCTGGGGTCCTGCTCTCA 60 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



128 GAGGCTGCCAAAATCCTGACAATATCTACAGTAGGTGGAAGCCATTATCTACTGATGGAC 187 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINI I I I I I 
61 GAGGCTGCCAAAATCCTGACAATATCTACACTGGGTGGAAGCCATTACCTACTGTTGGAC 120 

188 CGGGTTXCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAGA 247 
II II I I I I I I I I I I I I I I I I I I II I I I I I I I I I M II I I I I M I II I II 

121 CGGGTGTCTCAGATTCTTCAAGAGCATGGTCATAATGTGACTATGCTTCATCAGAGTGGA 180 

248 GGTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCTT 307 

III II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 
181 AAGTTTTTGATCCCAGATATTAAAGAGGAGGAAAAATCATACCAAGTTATCAGGTGGTTT 240 

308 GCACCTGAAGATCATCAAAGAGAATTTAAATVAGAGTTTTGATTTCTTTCTGGAAGAAACT 367 

I I I I M I II I I I I I II I I I I I I I I M I I I I M I I II INI II 

241 TCACCTGAAGATCATCAAAAAAGAATTAAGAAGCATTTTGATAGCTACATAGAAACAGCA 300 



Qy 368 TTAGGTGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCAG 427 

I I I I I I I I I I I I I I I I i I II I I I I I I I I I II I h II 

Db 301 TTGGATGGCAGAAAAGAATCTGAAGCCCTTGTAAAGCTAATGGAAATATTTGGGACTCAA 360 

Qy 428 TGCAGTCATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGAC 487 

II III I II I III I II I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I II 

Db 361 TGTAGTTATTTGCTAAGCAGAAAGGATATAATGGATTCCTTAAAGAATGAGAACTATGAT 420 

Qy 488 ATGGTGATAGTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAAG 547 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 421 CTGGTATTTGTTGAAGCATTTGATTTCTGTTCTTTCCTGATTGCTGAGAAGCTTGTGAAA 480 

Qy 548 CCATTTGTGGCCATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCCC 607 

I I I I I I I I I I I I I I I I II I I II I I II I I I I I II I I I I I II I II I I I II I II I I I I 
Db 481 CCATTTGTGGCCATTCTTCCCACCACATTCGGCTCTTTGGATTTTGGGCTACCAAGCCCC 540 

Qy 608 TTGTCTTATGTTCCAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGA 667 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 TTGTCTTATGTTCCAGTATTCCCTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGA 600 

Qy 668 GTGAAGAATTTTCTGATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTACA 727 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II III I I I I I I I I I I I I I I I 
Db 601 GTGAAGAATTTTCTGATGTTCTTTAGTTTCTCCAGGAGCCAATGGGACATGCAGTCTACA 660 

Qy 728 TTTGACAACACCATCAAGGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCTT 787 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 TTTGACAACACCATCAAGGAGCATTTCCCAGAAGGCTCTAGGCCAGTTTTGTCTCATCTT 720 

Qy 788 CTACTGAAAGCAGAGTTGTGGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACCT 847 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 
Db 721 CTACTGAAAGCAGAGTTGTGGTTTGTTAACTCTGATTTTGCCTTTGATTTTGCCCGGCCC 780 

Qy 848 CTGCTTCCCAACACTGTTTATGTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCA 907 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 CTGCTTCCCAACACTGTTTATATTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCA 840 

Qy 908 CAAGACTTGGAGAACTTCATTGCCAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTTG 967 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 841 CAAGACTTGGACAACTTCATTGCCAACTTTGGGGATGCAGGGTTTGTCCTTGTGGCCTTT 900 

Qy 968 GGCTCCATGGTGAACACCTGTCAGAATCCGGAAATCTTCAAGGAGATGAACAATGCCTTT 1027 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 901 GGCTCCATGTTGAACACCCATCAGTCCCAGGAAGTCCTCAAGAAGATGCACAATGCCTTT 960 

Qy 1028 GCTCACCTACCCCAAGGGGTGATATGGAAGTGTCAGTGTTCTCATTGGCCCAAAGATGTC 1087 

II I I I I I II III lU^^A. I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GCCCACCTCCCTCAAGGAGTGATATGGACATGTCAGAGTTCTCATTGGCCCAGAGATGTT 1020 

Qy 1088 CACCTGGCTGCAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCAC 1147 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 CATTTGGCCACAAATGTGAAAATTGTGGACTGGCTTCCTCAGAGTGACCTCCTGGCTCAC 1080 

Qy 1148 CCAAGCATCCGTCTGTTTGTCACCCACGGCGGGCAGAATAGCATAATGGAGGCCATCCAG 1207 

II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I III I I I I I I I I I I I I I I I 
Db 1081 CCCAGCATCCGTCTTTTTGTCACTCATGGTGGGCAGAACAGCGTAATGGAGGCCATCCGT 1140 

Qy 1208 CATGGTGTGCCCATGGTGGGGATCCCTCTCTTTGGAGACCAGCCTGAAAACATGGTCCGA 1267 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I II II 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 i 1 1 1 II 

1141 CATGGTGTGCCCATGGTGGGATTACCAGTCAATGGAGACCAGCATGG/^CATGGTCCGA 1200 



1268 



1327 



GTAGAAGCCAAAAAGTTTGGTGTTTCTATTCAGTTAAAGAAGCTCAAGGCAGAGACATTG 

II II II II I II I I I II I II I I II I I I I 11 11 I I I I I I I I I I I I I I 
1201 GTAGTAGCCAAAAATTATGGTGTCTCTATCCGGTTGAATCAGGTCACAGCCGACACACTG 1260 

1328 GCTCTTAAGATGAAACAAATCATGGAAGACAAGAGATACAAGTCCGCGGCAGTGGCTGCC 1387 

I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I II II I -I I I i I III 
12 6 1 ACACTTACAATGAAACAAGTCATAGAAGACAAGAGGTACAAGTCGGCAGTGGTGGCAGCC 1320 

1388 AGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAGCGGCTGGTGGGCTGGATTGAC 1447 

I I I I I I I I I I I I I III II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
1321 AGTGTCATCCTGCACTCTCAGCCCCTGAGCCCCGCACAGCGGCTGGTGGGCTGGATCGAC 1380 

1448 CACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTATGTCTTTCAGCAGCCCTGG 1507 

Ml I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I III 
1381 CACATCCTCCAGACTGGGGGAGCGACGCACCTCAAGCCCTATGCCTTCCAGCAGCCTTGG 1440 

1508 CATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGCTCACTCTGGGGACTCTA 1567 

I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 
1441 CATGAGCAGTACCTCATTGATGTCTTTGTGTTTCTGCTGGGGCTCACTCTGGGCACTATG 1500 

1568 TGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGTGGGGCCAGAAAGGTG 1627 

I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1501 TGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGCCAGGTGGCTGCGTGGGGCCAGGAAGGTG 1560 

1628 AAGGAGACATAA 1639 

III I I I I I I I 
1561 AAGAAGACATGA 1572 
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7VR541778 2426 bp DNA linear PAT 08-OCT-2004 

Sequence 26 from patent US 6743619. 

AR541778 

AR541778 .1 GI : 53933858 

Unknown. 

Unknown. 

Unclassified. 

1 (bases 1 to 2426) 

Tang^Y.T.^ Zhou, P., Goodrich, R., Liu,C., Asundi,V., Ren,F., 
Zhang, J., Zhao, Q. A., Yang,Y., Xue,A.J., Wehrman,T., Wang, J,. -R. , 
Wang,D. and Drmanac, R.T. . 

Nucleic acids and polypeptides 
Patent: US 6743619-A 26 Ol-JUN-2004; 

Location/Qualifiers 

1. .2426 

/organism= "unknown " 
/mol type=" genomic DNA" 



ORIGIN 



Query Match 49.5%; Score 1147.8; DB 6; Length 2426; 

Best Local Similarity 85.2%; Pred. No. 2.2e-293; 

Matches 1281; Conservative 0; Mismatches 222; Indels 0; Gaps 



0; 



Qy 160 AGGTGG7VAGCCATTATCTACTGATGGACCGGGTTTCTCAGATTCTTCAAGATCACGGTCA 219 

lllllllllilllll llllll Mllllllll I I I I I I I I I M I I I I I I II Mill 
Db 617 AGGTGGAAGCCATTACCTACTGTTGGACCGGGTGTCTCAGATTCTTCAAGAGCATGGTCA 676 

Qy 220 TAATGTCACCATGCTTAACCACAAAAGAGGTCCTTTTATGCCAGATTTTAAAAAGGAAGA 279 

llllll II llllll I II 1 II III II llllll I I I I I I II I II 

Db 677 TAATGTGACTATGCTTCATCAGAGTGGAAAGTTTTTGATCCCAGATATTAAAGAGGAGGA 736 

Qy 280 AAAATCATATCAAGTTATCAGTTGGCTTGCACCTGAAGATCATCAAAGAGAATTTAAAAA 339 

I I I I I I M I I I I I I I I i I II III II I I I I I I I II I I I II I I II I I II I I II 
Db 737 AAAATCATACCAAGTTATCAGGTGGTTTTCACCTGAAGATCATCAAAAAAGAATTAAGAA 796 

Qy 340 GAGTTTTGATTTCTTTCTGGAAGAAACTTTAGGTGGCAGAGGAAAATTTGAAAACTTATT 399 

I I II I I I I M I I I I I I I I I I II I I I I I I I I I I I I II I 
Db 797 GCATTTTGATAGCTACATAGAAACAGCATTGGATGGCAGAAAAGAATCTGAAGCCCTTGT 856 

Qy 400 AAATGTTCTAGAATACTTGGCGTTGCAGTGCAGTCATTTTTTAAATAGAAAGGATATCAT 459 

III I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I 
Db 857 AAAGCTAATGGAAATATTTGGGACTCAATGTAGTTATTTGCTAAGCAGAAAGGATATAAT 916 

Qy 460 GGATTCCTTAAAGAATGAGAACTTCGACATGGTGATAGTTGAAACTTTTGACTACTGTCC 519 

I I I I I I I I I I I I I I I I I I I I I I I II I I.I I I llllll I I I I I I I I I I I I 
Db 917 GGATTCCTTAAAGAATGAGAACTATGATCTGGTATTTGTTGAAGCATTTGATTTCTGTTC 976 

Qy 520 TTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGCCATTCTTTCCACTTCATTCGG 579 

I I I I M I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 977 TTTCCTGATTGCTGAGAAGCTTGTGAAACCATTTGTGGCCATTCTTCCCACCACATTCGG 1036 

Qy 580 CTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTCCAGTATTCCGTTCCTTGCT 639 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 1037 CTCTTTGGATTTTGGGCTACCAAGCCCCTTGTCTTATGTTCCAGTATTCCCTTCCTTGCT 1096 

Qy 640 GACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTG 699 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1097 GACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTGATGTTCTTTAGTTTCTC 1156 

Qy 700 CAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAAGGAACATTTCACAGA 759 

llllll III I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I llllll I I I I 
Db 1157 CAGGAGCCAATGGGACATGCAGTCTACATTTGACAACACCATCAAGGAGCATTTCCCAGA 1216 

Qy 760 AGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTCATTAACTC 819 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1217 AGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGTGGTTTGTTAACTC 1276 

.Qy 820 TGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTATGTTGGAGGCTT 879 

III I I I I I I I I I I I I I I I I I II II I I I I I I II I I I I I I I I I I M I I I II I I I I II 
Db 1277 TGATTTTGCCTTTGATTTTGCCCGGCCCCTGCTTCCCAACACTGTTTATATTGGAGGCTT 1336 

Qy 880 GATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAACTTCATTGCCAAGTTTGG 939 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1337 GATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGACAACTTCATTGCCAACTTTGG 1396 

Qy 940 GGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGAATCCGGA 999 

III I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I II I I I I I I I I I 
Db 1397 GGATGCAGGGTTTGTCCTTGTGGCCTTTGGCTCCATGTTGAACACCCATCAGTCCCAGGA 1456 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1000 AATCTTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGGAAGTG 1059 

^ I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 1 II I I I II II 
1457 AGTCCTCAAGAAGATGCACAATGCCTTTGCCCACCTCCCTCAAGGAGTGATATGGACATG 1516 

1060 TCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTG 1119 

I II I I I I I I I I I II I I I I I II I I I I II MM I II I II II II II I M I II II II 
1517 TCAGAGTTCTCAT.TGGCCCAGAGATGTTCATTTGGCCACAAATGTGAAAATTGTGGACTG 1576 

1120 GCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCGG 1179 

I I I II I II I II I I I I I I I I I I I I II I II I I I I I I I I I I II I I I I II I I I M II II 
1577 GCTTCCTCAGAGTGACCTCCTGGCTCACCCCAGCATCCGTCTTTTTGTCACTCATGGTGG 1636 

1180 GCAGAATAGCATAATGGAGGCCATCCAGCATGGTGTGCCCATGGTGGGGATCCCTCTCTT 1239 

II I II I III II I II I II I M II I I I II I II II II I I I I II II I I I II II 

1637 GCAGAACAGCGTAATGGAGGCCATCCGTCATGGTGTGCCCATGGTGGGATTACCAGTCAA 1696 

1240 TGGAGACCAGCCTGAAAACATGGTCCGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATTCA 1299 

II I II I I II I I II II II I I I I II I II I I I I I I I I I I I I I I I II I I I I II I I I 

1697 TGGAGACCAGCATGGAAACATGGTCCGAGTAGTAGCCAAAAATTATGGTGTCTCTATCCG 1756 

1300 GTTAAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAA 1359 

I I I II I I II I I I I I I I I II I I II I II I I II I II II I I II I I II II 
1757 GTTGAATCAGGTCACAGCCGACACACTGACACTTACAATGAAACAAGTCATAGAAGACAA 1816 

1360 GAGATACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCC 1419 

III I I I II I I I II I I II II I I I I I II I I I I II I I I III II II II I II I I 
1817 GAGGTACAAGTCGGCAGTGGTGGCAGCCAGTGTCATCCTGCACTCTCAGCCCCTGAGCCC 1876 

1420 CACACAGCGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCT 1479 

I I II I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
1877 CGCACAGCGGCTGGTGGGCTGGATCGACCACATCCTCCAGACTGGGGGAGCGACGCACCT 1936 

1480 CAAGCCCTATGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTT 1539 

I II I II II II I III I I II II I I II I I II II I II I I II I I I II II I I II I I I L 
1937 CAAGCCCTATGCCTTCCAGCAGCCTTGGCATGAGCAGTACCTCATTGATGTCTTTGTGTT 1996 

154 0 TCTGCTGGGGCTCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGT 1599 

II I I I I I I I I I I II I I I II I I III I II I I I I I I II I I I I I I I I I I I II III I 

1997 TCTGCTGGGGCTCACTCTGGGCACTATGTGGCTTTGTGGGAAGCTGCTGGGTGTGGTGGC 2056 

1600 CTGGTGGCTGCGTGGGGCCAGAAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCG 1659 

I II II I I I I II II II I I I I I I I I I II II I II I I II I I I I II I I I I I I II II I 
2057 CAGGTGGCTGCGTGGGGCCAGGAAGGTGAAGAAGACATGAGGCTAGGTGTAGCCTTGGGT 2116 

1660 GGG 1662 
II 

2117 GAG 2119 
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SUMMARIES 

No. Score Match Length DB ID Description 
RESULT 1 

ID AAC58107 standard; cDNA; 2320 BP. 

DE Human PRO1780 nucleotide sequence SEQ ID NO: 12. 

PN WO200053750-A1. 

PD 14-SEP-2000. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 3; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 2 

ID AAA37101 standard; cDNA; 2320 BP. 

DE Human PRO1780 (UNQ842) cDNA sequence SEQ ID NO:281. 

PN WO200012708-A2. 

PD 09-MAR-2000. 

PA (GETH ) GENENTECH INC. 



Score 2320; 
Pred. No. 0; 



Query Match 100.0%; 
Best Local Similarity 100.0%; 
RESULT 3 

ID AAF54401 standard; DNA; 2320 BP. 

DE Primer #82 used in the identification of proteins. 

PN WO200078961-A1. 

PD 28-DEC-2000. 

PA (GETH ) GENENTECH INC. 



DB 3; Length 2320; 



Query Match 100.0%; Score 2320; DB 4; Length 2320; 

Best Local Similarity 100.^0%; Pred» No. 0; 
RESULT 4 

ID ACD68440 standard; cDNA; 2320 BP. 

DE Novel human secreted and transmembrane protein PRO1780 cDNA. 
PN US2003073130-A1. 
PD 17-APR-2003. 

PA (GETH ) GENENTECH INC. ^ 

Query Match 100.0%; Score 2320; DB 9; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 5 . ■ 

ID ACH04542 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003044841-A1. 

PD 06-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 9; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 6 

ID ACD68086 standard; cDNA; 2320 BP. 

Novel human secreted and transmembrane protein PRO1780 cDNA. 
US2003073129~A1. 



DE 
PN 
PD 
PA 



17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 100.0%; 
Best Local Similarity 100.0%; 
RESULT 7 

ID ADC18150 standard; cDNA; 2320 BP 

DE Human PRO polynucleotide #80. 

PN US2003064925-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; 

Best Local Similarity 100.0%; 
RESULT 8 

ID ADD70796 standard; cDNA; 



Score 2320; DB 9; Length 2320; 
Pred. No. 0; 



Score 2320; 
Pred. No. 0; 



DB 10; Length 2320; 



Length 2320; 



2320 BP. 

Human cDNA encoding secreted/transmembrane protein PRO1780. 
US2003099625-A1. 
29-MAY-2003. 
(GETH ) GENENTECH INC. 
Query Match 100.0%; Score 2320; DB 10; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 9 

ID ADD39873 standard; cDNA; 2320 BP. 

Human cDNA encoding secreted/transmembrane protein PRO1780. 

.yS2003083462-Al. 

Ol-MAY-2003. 
(GETH ) GENENTECH INC. 
Query Match 100.0%; Score 2320; DB 10; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 10 

ID ADD70319 standard; cDNA; 2320 BP. 

Human cDNA encoding secreted/transmembrane protein PRO1780. 
US2003054406-A1. 
20-MAR-2003. 
(GETH ) GENENTECH INC. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2320; 



Query Match 



100.0%; Score 2320;. DB 10; Length 2320; 



Best Local Similarity 100-0%; Pred. No. 0; 
RESULT 11 

ID ADD38440 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/ transmembrane protein PRO1780. 

PN US2003096955-A1 . 

PD 22-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 12 

ID ADD39396 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003096954-A1. 

PD 22-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 13 

ID ADD38919 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003092061-A1. 

PD 15-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 14 

ID ADD40350 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003082627-A1. 

PD Ol-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 15 

ID ADE50571 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003069179~A1. 

PD lO-APR-2003. . 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 16 

ID ADE20183 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003092883-A1. 

PD 15 -MAY- 2 0.0 3. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 17 

ID ADE50094 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003082626-A1. 

PD Ol-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 



RESULT 18 

ID ADE21652 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/ transmembrane protein PRO1780. 

PN US2003082628-A1. 

PD Ol-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 19 

ID ADF30077 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 
PN US2003204053-A1. 

PD 30-OCT-2003. ^ 
PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 20 

ID ADF55970 standard; cDNA; 2320 BP. 

DE Hioman cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003204054-A1. 

PD 30-OCT-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 21 

ID ADH99474 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003065142-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 10; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 22 

ID ADE96654 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003195347-A1. 

PD 16-OCT-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 23 

ID ADF25965 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003199675-A1. 

PD 23-OCT-2003. 

PA (GETH ) GENENTECH. JNC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 24 

ID ADF24864 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003198993-A1. 

PD 23-OCT-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 25 



ID ADF29600 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmerabrane protein PRO1780. 

FN US2003203401-A1. 

PD 30-OCT-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 26 

ID ADE97131 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780, 

PN US2003195334-A1. 

PD 16-OCT-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 27 

ID ADH03169 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003216562-A1. 

PD 20-NOV-2003. 

PA (GETH .) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 28 ' 

ID ADH04123 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003220471-A1 . 

PD 27-NOV-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 29 

ID ADH0364 6 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2003224478-A1. 

PD 04-DEC-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 30 

ID ADH04600 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2004005626-A1. 

PD 08-JAN-2004. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%.;.,,.. Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 31 

ID ADH61601 standard; cDNA; 2320 BP. 

DE Human cDNA encoding secreted/transmembrane protein PRO1780. 

PN US2004014130-A1. 

PD 22-JAN-2004. 

PA (GETH ) GENENTECH INC. ' 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 32 

ID ADL94800 standard; cDNA; 2320 BP. 



DE Human cDNA encoding secreted/ transmembrane protein PRO1780. 

PN US2004073015-A1. 

PD 15-APR-2004. 

PA (GETH ) GENENTECH INC. 

Query Match 100.0%; Score 2320; DB 12; Length 2320; 

Best Local Similarity 100.0%; Pred. No. 0; 
RESULT 33 

ID AAF93775 standard; cDNA; 2341 BP. 

DE Human cDNA encoding a membrane or secretory protein clone PSEC0073. 

PN EP1067182-A2. 

PD lO-JAN-2001. 

PA (HELI-) HELIX RES INST. 

Query Match 99.6%; Score 2310.2; DB 5; Length 2341; 

Best Local Similarity 99.9%; Pred. No. 0; 
RESULT 34 

ID AAL41485 standard; DNA; 2944 BP. 

DE Drug metabolising enzyme encoding DNA - 7486594CB1. 
PN WO200266654-A2. 
PD 29-AUG-2002. 

PA (INCY-) INCYTE GENOMICS INC. 

Query Match 90.6%; Score 2102.8; DB 6; Length 2944; 

Best Local Similarity 95.5%; Pred. No. 0; 
RESULT 35 

ID AAS62475 standard; cDNA; 2074 BP. 

DE cDNA sequence #262 encoding novel human secreted protein. 

PN WO200177291-A2. 

PD 18-OCT-2001. 

PA (GEMY ) GENETICS INST INC. 

Query Match 88.9%; Score 2063.4; DB 6; Length 2074; 

Best Local Similarity 99.7%; Pred. No. 0; 
RESULT 36 

ID ADR19692 standard; DNA; 2263 BP. 

DE Human drug metabolising enzyme (DME)-13 gene sequence. 
PN WO200226988-A2. 
PD 04-APR-2002. 

PA (INCY-) INCYTE GENOMICS INC. 

Query Match 68.1%; Score 1578.8; DB 7; Length 2263; 

Best Local Similarity 92.6%; Pred. No. 0; 
RESULT 37 

ID AAD06821 standard; cDNA; 2797 BP. 

DE Human secreted protein Zalpha37 cDNA. 

PN WO200138505-A2. 

PD 31-MAY-2001. 

PA (ZYMO ) ZYMOGENETICS INC. 

Query Match 55.2%; Score 1281.6; DB 4; Length 2797; 

Best Local Similarity 85.9%; Pred. No,.^ 0; 
RESULT 38 

ID ADA53720 standard; cDNA; 2823 BP. 

DE Human coding sequence, SEQ ID 1288. 

PN EP1293569-A2. 

PD 19-MAR-2003. 

PA (HELI-) HELIX RES INST. 

PA (REAS-) RES ASSOC BIOTECHNOLOGY. 

Query Match 55.2%; Score 1281.6; DB 10; Length 2823; 

Best Local Similarity 85.9%; Pred. No. 0; 
RESULT 39 

ID AAD24667 standard; cDNA; 2086 BP. 



DE Human drug metabolising enzyme (DME)-2 cDNA. 
PN WO200L79468-A2. 
PD 25-OCT-2001, 

PA (INCY-) INCYTE GENOMICS INC. 

Query Match 55.2%; Score 1280; 

Best Local Similarity 85.8%; Pred. No. 0; 
RESULT 40 

ID ABK90831 standard; cDNA; 1851 BP. 

cDNA encoding drug metabolising enzyme. 
US2002082194-A1. 
27-JUN-2002. 
(GUEG/) GUEGLER K. 
(WEBS/) WEBSTER M. 
YAN C. 

DI FRANCESCO V. 
BEASLEY E M. 



DB 6; Length 2086; 



DE 
PN 
PD 
PA 
PA 
PA 
PA 
PA 



(YANC/) 
(DFRA/) 
(BEAS/) 
Query Match 
Best Local Similarity 
RESULT 41 

ID ADN02571 standard; DNA; 



54.8%; Score 1270.4. 
85.9%; Pred. No. 0; 



DB 6; Length 1851; 



DB 11; Length 2868; 



Score 1257.6; 
Pred. No. 0; 



DB 6; Length 2082; 



2868 BP. 

Human hepatouracil dinucleotide glycosyltransf erase 29.7 DNA. 
CN1393551-A. 
29-JAN-2003. 

(BIOW-) BIOWINDOW GENE DEV INC SHANGHAI. 
Query Match 54.5%; Score 1264.8; 

Best Local Similarity 85.6%; Pred. No. 0; 
RESULT 42 

ID AAD25345 standard; cDNA; 2082 BP. 

Human UDP-glycosyltransf erase, 32626 cDNA. 
WO200202774-A2. 
lO-JAN-2002. 

(MILL-) MILLENNIUM PHARM INC. 
Query Match 54.2%; 
Best Local Similarity 85.0%; 
RESULT 43 

ID ABL57726 standard; DNA; 1572 BP. 
Human sbgl0054OUDPGT gene #2. 
WO200222802-A1. 
21-MAR-2002. 

(SMIK ) SMITHKLINE BEECHAM CORP. 
(SMIK ) SMITHKLINE BEECHAM PLC. 
(GLAX ) GLAXO GROUP LTD. 
Query Match 52.2%; 
Best Local Similarity 85.7%; 
RESULT 44 

ID ADA21188 standard; cDNA; 2268 BP. 

Human secreted protein SECP-42 encoding cDNA SEQ ID NO: 93. 
WO2003068943-A2. 
21-AUG-2003. 

(INCY-) INCYTE GENOMICS INC. 
Query Match 49.5%; Score 1147.8; 

Best Local Similarity 85.2%; Pred. No. 0; 
RESULT 45 

ID ABX70800 standard; cDNA; 2426 BP. 
DE Novel human cDNA sequence #25. 
PN WO200281731-A2. 
PD 17-OCT-2002. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 
PA 
PA 



Score 1212; 
Pred. No. 0; 



DB 6; Length 1572; 



DE 
PN 
PD 
PA 



DB 9; Length 2268; 



49.5%; Score 1147.8; 
85.2%; Pred. No. 0; 



DB 8; Length 2426; 



PA (HYSE-) HYSEQ INC. 
PA (GOOD/) GOODRICH. R W 
Query Match 
Best Local Similarity 
RESULT 46 

ID ADQ75567 standard; cDNA; 2791 BP. 

DE Uridine diphosphate-galactosyl ceramide glucuronyl collagen transferase. 
PN CN1380409-A. 
PD 20-NOV-2002. 

PA (SHAN-) SHANGHAI BIOWINDOW GENE DEV INC. 



Query Match 49.5%; 
Best Local Similarity 85.2%; 
RESULT 47 

ID ABL57725 standard; DNA; 1182 BP 
DE Human sbgl0054OUDPGT gene #1. 
PN WO200222802-A1. 
PD 21-MAR-2002. 

PA (SMIK ) SMITHKLINE BEECHAM CORP 
PA (SMIK ) SMITHKLINE BEECHAM PLC. 
PA (GLAX ) GLAXO GROUP LTD. 



Score 1147.8; 
Pred. No. 0; 



DB 11; Length 2791; 



Score 956.4; DB 6; Length 1182; 
Pred. No. 4.4e-274; 



Query Match 41.2%; 
Best Local Similarity 88.1%; 
RESULT 48 

ID AAD06822 standard; DNA; 1569 BP. 

DE Human secreted protein Zalpha37 degenerate coding sequence. 

PN WO200138505-A2. 

PD 31-MAY-2001. 

PA (ZYMO ) ZYMOGENETICS INC. 



Score 944.8; DB 4; Length 1569; 
Pred. No. 1.5e-270; 



Query Match 40.7%; 
Best Local Similarity 52.4%; 
RESULT 49 

ID AAD06823 standard; cDNA; 2212 BP. 
DE Mouse secreted protein Zalpha37 cDNA. 
PN WO200138505-A2. 
PD 31-MAY-2001. 
PA (ZYMO ) ZYMOGENETICS INC. 

Query Match 39.3%; 

Best Local Similarity 73.0%; 
RESULT 50 

ID AAS84832 standard; cDNA; 2721 BP. 

DNA encoding novel human diagnostic protein #20636 
WO200175067-A2. 



Score 910.8; DB 4; 
Pred. No. 2.8e-260; 



Length 2212; 



DE 
PN 
PD 
PA 



34.9%; Score 808.8; DB 5; 
95.8%; Pred. No. 9.2e-230; 



Length 2721; 



ll-OCT-2001. 
(HYSE-) HYSEQ INC. 
Query Match 
Best Local. Similarity 
RESULT 51 

ID AAD06824 standard; DNA; 1569 BP. 

DE Mouse secreted protein Zalpha37 degenerate coding sequence 

PN WO200138505-A2. 

PD 31-MAY-2001. 

PA (ZYMO ) ZYMOGENETICS INC. 

Query Match 34.0%; Score 789; 

Best Local Similarity 47.8%; Pred. No. 5 
RESULT 52 

ID ABA09537 standard; cDNA; 1898 BP. 

DE Human PRO1780 homologue- encoding cDNA, SEQ ID NO: 1313 



DB 4; Length 1569; 
2e-224; 



Score 708.2; DB 4; 
Pred. No. 8e-200; 



Length 1898; 



PN WO200157188-A2. 
PD 09-AUG-2001. 
PA (HYSE-) HYSEQ INC. 

Query Match 30.5%; 

Best Local Similarity 86.4%; 
RESULT 53 

ID AAF93971 standard; DNA; 770 BP. 

Primer specific for DNA encoding secretory /membrane protein SEQ ID 405. 
EP1067182-A2. 
lO-JAN-2001. 
(HELI-) HELIX RES INST. 
Query Match 25.7%; Score 595.2; DB 5; Length 770; 

Best Local Similarity 93.4%; Pred. No. 2.4e-166; 
RESULT 54 

ID ADB62657 standard; cDNA; 1842 BP. 

Human cDNA encoding clone KIDNE20186170 . 
EP1308459-A2. 
07-MAY-2003. 
(HELI-) HELIX RES INST. 
(REAS-) RES ASSOC BIOTECHNOLOGY. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 

PA 



Score 593.4; DB 10; Length 1842; 
Pred. No. 1.5e-165; 



Score 575.4; DB 12; 
Pred. No. 6.9e-160; 



Length 5002; 



Query Match 25.6%; 
Best Local Similarity 85.6%; 
RESULT 55 

ID ADQ64368 standard; cDNA; 5002 BP. 
Novel human cDNA sequence #1529. 
EP1440981-A2. 
28-JUL-2004. 

(REAS-) RES ASSOC BIOTECHNOLOGY. 
Query Match 24.8%; 
Best Local Similarity 85.3%; 
RESULT 56 

ID AAF94120 standard; DNA; 594 BP. 

Primer specific for DNA encoding secretory/membrane protein SEQ ID 554, 
EP1067182-A2. 
lO-JAN-2001. 
(HELI-) HELIX RES INST. 
Query Match 23.2%; Score 538.4; DB 5; Length 594; 

Best Local Similarity 98.5%; Pred- No. 1.9e-149; 
RESULT 57 

ID AAS81804 standard; cDNA; 2220 BP. 

DNA encoding novel human diagnostic protein #17608. 



DE 
PN 
PD 

PA 



DE 
PN 
PD 

PA 



DE 
PN 
PD 
PA 



21.0%; Score 487.6; DB 5; 
93.7%; Pred. No. 6.7e-134; 



Length 2220; 



WO200175067-A2. 
ll-OCT-2001. 
(HYSE-) HYSEQ INC, 
Query Match 
Best Local Similarity 
RESULT 58 

ID ADN39359 standard; cDNA; 923 BP. 

DE Cancer/angiogenesis/f ibrosis-related nucleic acid, SEQ ID NO:B43. 
PN WO2003042661-A2 . 
PD 22-MAY-2003. 

PA (EOSB-) EOS BIOTECHNOLOGY INC. 

Query Match 19.0%; Score 441; DB 11; Length 923; 

Best Local Similarity 100,0%; Pred. No. 3,le-120; 

RESULT 59 

ID ABK90832 standard; DNA; 42999 BP. 

DE Genomic DNA encoding drug metabolising enzyme. 



PN US2002082194-A1. 

PD 27-JUN~2002. 

PA (GUEG/) GUEGLER K. 

PA (WEBS/) WEBSTER M. 

PA (YANC/) YAN C. 

PA (DFRA/) DI FRANCESCO V. 

PA (BEAS/) BEASLEY E M. 

Query Match 18.7%; Score 433.6; DB 6; Length 42999; 

Best Local Similarity 88.1%; Pred. No. 6.4e-117; 
RESULT 60 

ID AAS81803 standard; cDNA; 659 BP. 

DE DNA encoding novel human diagnostic protein #17607. 

PN WO200175067-A2. 

PD ll-OCT-2001. 

PA (HYSE-) HYSEQ INC. 

Query Match 17.2%; Score 399.6; DB 5; Length 659; 

Best Local Similarity 92.5%; Pred. No. 5.7e-108; 
RESULT 61 

ID AAS41046 standard; cDNA; 923 BP. 

DE cDNA encoding novel human enzyme polypeptide #262. 
PN WO200155301-A2. 
PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 13.8%; Score 319.6; DB 4; Length 923; 

Best Local Similarity 84.8%; Pred. No. 5.5e-84; 
RESULT 62 

ID AAS41586 standard; cDNA; 981 BP. 

DE cDNA encoding novel hxaman enzyme polypeptide #802. 
PN WO200155301-A2. 
PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 13.8%; Score 319.4; DB 4; Length 981; 

Best Local Similarity 85.4%; Pred. No. 6.6e-84; 
RESULT 63 

ID AAL01667 standard; cDNA; 981 BP. 

DE Human reproductive system related antigen cDNA SEQ ID NO: 1668. 
PN WQ20015532Q-A2. 
PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 13.8%; Score 319.4; DB 4; Length 981; 

Best Local Similarity 85.4%; Pred. No. 6.6e-84; 
RESULT 64 

ID ADQ57767 standard; DNA; 582 BP. 

DE Novel canine microarray-related DNA sequence SeqID9069. 

PN WO2004063324-A2. 

PD 29-JUL-2004. 

PA (GENE-) GENE LOGIC INC. 

PA ■ (PFIZ ) PFIZER PROD INC. 

Query Match 13.6%; Score 314.8; DB 13; Length 582; 

Best Local Similarity 81.8%; Pred. No. l.le-82; 
RESULT 65 

ID AAL05664 standard; DNA; 5973 BP. 

DE Human reproductive system related antigen DNA SEQ ID NO: 8352. 
PN WO200155320-A2. 
PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 10.3%; Score 239.2; DB 4; Length 5973; 



Best Local Similarity 86.0%; Bred. No. 2e-59; 
RESULT 66 

ID AAK11854 standard; DNA; 507 BP. 

DE Human brain expressed single exon probe SEQ ID NO: 11845. 
PN WO200157275-A2. 
PD 09-AUG-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 10.3%; Score 238.8; DB 4; Length 507; 

Best Local Similarity 86.3%; Pred. No. 5e-60; 
RESULT 67 

ID AAL05663 standard; DNA; 2751 BP. 

DE Human reproductive system related antigen DNA SEQ ID NO: 8351. 
PN WO200155320-A2. 
PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 10.3%; Score 238.8; DB 4; Length 2751; 

Best Local Similarity 86.3%; Pred. No. 1.5e-59; 
RESULT 68 

ID AAL05665 standard; DNA; 5974 BP. 

DE Human reproductive system related antigen DNA SEQ ID NO: 8353. 
PN WO200155320-A2. 
PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 10.3%; Score 238.8; DB 4; Length 5974; 

Best Local Similarity 86.3%; Pred. No. 2.6e-59; 
RESULT 69 

ID AAS84831 standard; cDNA; 1242 BP. 

DE DNA encoding novel human diagnostic protein #20635, 

PN WO200175067-A2. 

PD ll-OCT-2001. 

PA (HYSE-) HYSEQ INC. 

Query Match 9.7%; Score 224; DB 5; Length 1242; 

Best Local Similarity 100.0%; Pred. No. 2.4e-55; 
RESULT 70 

ID AAS82074 standard; cDNA; 1242 BP. 

DE DNA encoding novel human diagnostic protein #17878. 

PN WO200175067-A2. 

PD ll-OCT-2001. 

PA (HYSE-) HYSEQ INC. 

Query Match 9.7%; Score 224; DB 5; Length 1242; 

Best Local Similarity 100.0%; Pred. No. 2.4e-55; 
RESULT 71 

ID AAK24438 standard; DNA; 277 BP. 

DE Human brain expressed single exon probe SEQ ID NO: 24429. 
PN WO200157275-A2. 

PD 09-AUG-2001. _ 
PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 9.5%; Score 220; DB 4; Length 277; 

Best Local Similarity 87.3%; Pred. No. 1.4e-54; 
RESULT 72 

ID ABI99695 standard; cDNA; 2308 BP. 

DE Mouse ischaemic condition related cDNA sequence SEQ ID NO: 738. 
PN WO200188188-A2. 
PD 22-NOV-2001. 

PA (UYNI-) UNIV NIHON SCHOOL JURIDICAL PERSON. 

Query Match 7.1%; Score 164; DB 6; Length 2308; 

Best Local Similarity 47.5%; Pred. No. 3e-37; 



RESULT 73 

ID ACN45084 standard; DNA; 49753 BP. 
DE Mouse genomic sequence mCG7831. 
PN WO2003073826-A2 . 
PD 12-SEP-2003. 
PA (SAGR~) SAGRES DISCOVERY. 

Query Match 7.0%; Score 162.6; DB 11; Length 49753; 

Best Local Similarity 72.7%; Pred. No. 6e-36; 
RESULT 74 

ID ADB59021 standard; DNA; 1716 BP. 
DE Toxi city-related gene, SEQ ID 4047. 
PN WO2003064624-A2. 
PD O7-AUG72OO3. 
PA (GENE-) GENE LOGIC INC. 

Query Match 6.4%; Score 148; DB 10; Length 1716; 

Best Local Similarity 51.9%; Pred. No. 1.5e-32; 
RESULT 75 

ID ADB53779 standard; DNA; 1716 BP. 

DE Primary rat hepatocyte toxicity modelling related gene SEQ ID NO: 4321. 
PN WO2003065993-A2. 
PD 14-AUG-2003. 
PA (GENE-) GENE LOGIC INC. 

Query Match 6.4%; Score 148; DB 10; 

Best Local Similarity 51.9%; Pred. No. 1.5e-32; 
RESULT 76 

ID ADP73i308 standard; DNA; 1716 BP. 
DE Renal toxin progression gene marker #1597. 
PN WO2004 048598-A2. 
PD lO-JUN-2004. 
PA (GENE-) GENE LOGIC INC. 

Query Match 6.4%; Score 148; DB 12; 

Best Local Similarity 51.9%; Pred. No. 1.5e-32; 
RESULT 77 

ID ADP72663 standard; DNA; 1819 BP. 
DE Renal toxin progression gene marker #1252. 
PN WO2004048598-A2. 
PD lO-JUN-2004. 
PA (GENE-) GENE LOGIC INC. 

Query Match 6.4%; Score 148; DB 12; 

Best Local Similarity 51.9%; Pred. No. 1.5e-32; 
RESULT 78 

ID ABK63496 standard; cDNA; 1961 BP. 
DE Rat sequence differentially expressed in response to a hepatotoxin #1403. 
PN WO200210453-A2. 
PD 07-FEB-2002. 
PA (GENE-) GENE LOGIC INC. 

Query Match 6.3%; Score 145.2; DB 6; Length 1961; 

Best Local Similarity 49.4%; Pred. No. l.le-31; 
RESULT 79 

ID ADB57963 standard; DNA; 1961 BP. 

DE Toxicity-related gene, SEQ ID 2989. 

PN WO2003064624-A2. 

PD 07-AUG-2003. 

PA (GENE-) GENE LOGIC INC. 

Query Match 6.3%; Score 145.2; DB 10; Length 1961; 

Best Local Similarity 49.4%; Pred. No. l.le-31; 
RESULT 80 



Length 1716; 



Length 1716; 



Length 1819; 



ID 
DE 
PN 
PD 
PA 



ABZ58827 standard; cDNA; 1584 BP. 

Human 32624 polypeptide coding sequence. 

WO200226834-A2. 

04-APR-2002. 

(MILL-) MILLENNIUM PHARM INC. 



6.2%; 
48.4%; 



Score 144.2; DB 6; Length 1584; 



Pred. No. 1.9e-31; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Score 144.2; DB 6; 
Pred. No. 2.9e-31; 



Length 2996; 



6.2%; 
48.3%; 



Score 143.2; DB 6; Length 1636; 
Pred. No. 3.9e-31; 



Query Match 
Best Local Similarity 
RESULT 81 

ID ABZ58826 standard; cDNA; 2996 BP. 

Human 32624 polypeptide encoding cDNA. 
WO200226834-A2. 
04-APR-2002. 

(MILL-) MILLENNIUM PHARM INC. 
Query Match 6.2%; 
Best Local Similarity 48.4%; 
RESULT 82 

ID AAD24666 standard; cDNA; 1636 BP. 

Human drug metabolising enzyme (DME)-l cDNA. 
WO200179468-A2. 
25-OCT-2001. 

(INCY-) INCYTE GENOMICS INC. 
Query Match 
Best Local Similarity 
RESULT 83 

ID ADA11047 standard; cDNA; 1636 BP. 

Human cDNA differentially expressed in colon cancer #117. 
US2002160382-A1. 
31-OCT-2002. 
(LASE/) LASEK A W. 
(JONE/) JONES D A. 
Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 84 

ID ADS09871 standard; DNA; 2408 BP. 

Human therapeutic DNA - SEQ ID 108. 
WO2004080148-A2. 
23-SEP-2004. 
(NUVE-) NUVELO INC. 
Query Match 6.2%; Score 143.2; DB 13; Length 2408; 

Best Local Similarity 48.3%; Pred. No. 5e-31; 
RESULT 85 

ID ABK12422 standard; cDNA; 2759 BP. 

cDNA encoding human drug-metabolising enzyme. 
WO200218554-A2. 
07-MAR-2002. 
(APPLr:). APPLERA CORP. 



DE 
PN 
PD 
PA 
PA 



DE 
PN 
PD 
PA 



Score 143.2; DB 9; 
Pred. No. 3.9e-31; 



Length 1636; 



DE 

PN 
PD 
PA 



Score 143.2; DB 6; Length 2759; 



Pred. No. 5.5e-31; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 86 

ID ADA10915 standard; cDNA; 2966 BP. 

Human cDNA differentially expressed in colon cancer #22. 
US2002160382-A1. 



DE 
PN 
PD 
PA 
PA 



31-OCT-2002. 
(LASE/) LASEK A W. 
(JONE/) JONES D A. 
Query Match 
Best Local Similarity 



6.2%; Score 143.2; DB 9; Length 2966; 
48.3%; Pred. No. 5.7e-31; 



RESULT 87 

ID ADL12512 standard; cDNA; 2966 BP. 

DE Human steroid-induced C3A liver cell cDNA #241. 

PN US6673549-B1. 

PD 06-JAN-2004. 

PA (INCY-) INCYTE CORP. 



6.2%; Score 143.2; DB 12; 
48.3%; Pred. No. 5.7e-31; 



Length 2966; 



Score 143.2; DB 4; 
Pred. No. 5.7e-31; 



Length 2974; 



Query Match 
Best Local Similarity 
RESULT 88 

ID AAS46185 standard; cDNA; 2974 BP. 

DE Human DNA encoding PRO polypeptide sequence #261. 
PN WO200168848-A2. 
PD 20-SEP-2001. ■ 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 89 

ID ACA89635 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261, 
PN US2003036141-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 90 

ID ACA73645 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003036146-A1 . 

PD 20-FEB-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 91 

ID ACA05960 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003036162-A1. 

PD 20-FEB-2003. 

PA (GETH ) GENENTECH INC. 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 92 

ID ACA66794 standard; cDNA; 2974 BP. 

DE cDNA encoding human PRO protein #261 

PN US2003036137-A1. 

PD 20-FEB-2003. 

PA (GETH ) GENENTECH INC. 



Score 143.2; DB 8; Length 2974; 
Pred. No. 5.7e-31; 



Score 143.2; DB 8; Length 2974; 
Pred. No. 5.7e-31; 



SEQ ID NO: 521, 



Query Match .. 6.2%; 

Best Local Similarity 48.3%; 
RESULT 93 

ID ACF20369 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040063-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 94 

ID ACF19755 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 



Length 2974; 



SEQ ID NO: 521. 



PN US2003040064-A1. 
PD 27-FEB-2003.^ 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 95 

ID ACD22043 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003027267-A1. 
PD 06-FEB-2003. 

Query Match. 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 96 

ID ACF13208 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 
PN US2003036160-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 97 

ID ACD25311 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003044925-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 98 

ID ACF00360 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003054474-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 99 

ID ACA72417 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003032114-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 100 

ID ACD04941 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003032101-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 101 

ID ACD18402 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036124-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5-7e-31; 
RESULT 102 

ID ACD08409 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 



PN US2003040054-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 103 

ID ACA88843 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003036133-A1 . 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 104 

ID ACA70285 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036134-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 105 

ID ACD12507 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003022294-A1 . 
PD 30-JAN-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 106 

ID ACC74422 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 
PN US2003027275-A1 . 
PD 06-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 107 

ID ACD16050 standard; cDNA; 2974 BP. 

DE Human' secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003027324-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 108 

ID ACD25618 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 

PN US2003036118-A1 . 

PD 20-FEB-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score .143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 109 

ID ACD18095 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261- 
PN US2003036123-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 110 

ID ACC88382 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 



PN US2003036148-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 111 

ID ACD21736 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003040060-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 112 

ID ACD18803 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003044916-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; Score 143.2; . DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 113 

ID ABX98413 standard; cDNA; 2974 BP. 

DE Human cDNA encoding a secreted/transmembrane protein^ SEQ ID 521. 
PN US2003036156-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 114 

ID ACD14164 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003032117-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 115 

ID ACD09944 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036128-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 116 

ID ACC88689 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 
PN US2003027266-A1 . 
PD 06-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 117 

ID ACD21429 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003054483-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 118 

ID ABX75801 standard; cDNA; 2974 BP. 

DE Human cDNA encoding secreted/transmembrane protein, PR06239. 



2; DB 8; 
5.7e-31; 



Length 2974; 



2; DB 8; 
5.7e-31; 



Length 2974; 



2; DB 8; 
5*7e-31; 



Length 2974; 



2; DB 8; 
5.7e-31; 



Length 2974; 



PN US2003022298-A1. 
PD 30-JAN-2003. 

Query Match 6.2%; Score 143, 

Best Local Similarity 48.3%; Pred. No. 
RESULT 119 

ID ABX98004 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003032102-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143. 

Best Local Similarity 48.3%; Pred. No. 
RESULT 120 

ID ACA97480 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003036117-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143. 

Best Local Similarity 48.3%; Pred. No. 
RESULT 121 

ID ACA57943 standard; cDNA; 2974 BP. 
DE Human PR06239 cDNA. 
PN US2003036143-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143. 

Best Local Similarity 48.3%; Pred. No. 
RESULT 122 

ID ACD14471 standard; cDNA; 2974 BP. . 
DE Human PRO polynucleotide #261. 
PN US2003032130-A1 . 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143. 

Best Local Similarity 48.3%; Pred. No. 
RESULT 123 

ID ACC91254 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003032138-A1. 

PD 13-FEB-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB'8; 

Best Local Similarity 48.3%; Pred, No. 5.7e-31; 
RESULT 124 

ID ACC88996 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003036132-A1. 
PD 20-FEB-2003. 

Query. Match 6.2%; Score 143. 

Best Local Similarity 48.3%; Pred. No. 
RESULT 125 

ID ACD07193 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US20030083.53-A1 . 
PD 09-JAN-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 126 

ID ACA67644 standard; cDNA; 2974 BP. 



2; DB 8; 
5.7e-31; 



Length 2974; 



2; DB 8; 
5.7e-31; 



Length 2974; 



SEQ ID NO: 521, 



Length, 2974; 



143. 
No. 



2; DB 8; 
5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



143, 
No. 



2; DB 8; 
5.7e-31; 



DE Human PRO polynucleotide #261 
PN US2003017542-A1. 
PD 23-JAN-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 127 

ID ACC81699 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003032137-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 128 

ID ACC89303 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003027269-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 129 

ID ACC86659 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003027268-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 130 

ID ACC89917 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003027274-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 131 

ID ACC93096 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR062 3 9- encoding " cDNA, 
PN US2003032135-A1. 
PD 13-FEB-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 132 

ID ACA72724 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003022295-A1. 
PD 30-JAN-2003. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 133 

ID ACA89242 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003022297-A1. 

PD 30-JAN-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 134 

ID ACA69978 standard; cDNA; 2974 BP. 



143. 
No. 



2; DB 8; 
5.7e-31; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



143.2; DB 8; 
No. 5.7e-31; 



Length 2974; 



2974; 



DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003032105-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 135 

ID ACA97121 standard; cDNA; 2974 BP. 

DE Novel hiiman secreted and transmembrane protein PR06239 cDNA. 
PN US2003032123-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 136 

ID ACA91117 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003032108-A1. 
PD' 13-FEB-2003, 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 137 

ID ACA70899 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003032111-A1. 
PD 13-FEB-2003. 

Query Match . 6,2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 138 

ID ACA95409 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 

PN US2003032119-A1. 

PD 13-FEB-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 139 

ID ACC86352 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 
PN US2003027263-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity . 48.3%; Pred. No. 5.7e-31; 
RESULT 140 

ID ACC90224 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 
PN .US2003027271-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 141 

ID ACD12832 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036125-A1 . 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 142 

ID ACF20062 standard; cDNA; 2974 BP. 



Score 143.2; DB 8; Length 2974; 
Pred. No. 5.7e-31; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521, 
PN US2003040068-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 143 

ID ABX77006 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003027280-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 144 

ID ACA73338 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003022300-A1. 
PD 30-JAN-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 145 

ID ACA68881 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003036136-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 146 

ID ACA74725 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003036138-A1 . 
PD 20-FEB~2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 147 

ID ACA70592 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003032109-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 148 

ID ACD14778 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003040066-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 149 

ID ACA68450 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003032104-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 150 

ID ABX98915 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score. 143.2; DB 8; Length 2974; 
Pred. No. 5.7e-31; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7eT31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



PN US2003036157-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 151 

ID ACC81392 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 
PN US2003032120-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 152 

ID ACA95716 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003036155-A1 . 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 153 

ID ACD04634 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003022296-A1. 
PD 30-JAN-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 154 

ID ACC88075 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003027281-A1. 
PD 06-FEB-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 4,8.3%; Pred. No. 5.7e-31; 
RESULT 155 

ID ACF12737 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040058-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 156 

ID ACA96452 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003017540-A1. 
PD 23-JAN-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 157 

ID ACA65226 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003032106-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 158 

ID ACA73952 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



Score 143.2; ,.DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; Length 2974; 
Pred. No. 5.7e-31; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



PN US2003032129-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 159 

ID ACA74364 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003032131-A1 . 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 160 

ID ACA96759 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003032103-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 161 

ID ACD10865 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003032107-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 162 

ID ACC91561 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003032139-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48*3%; Pred. No. 5.7e-31; 
RESULT 163 

ID ACD02896 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003022301-A1. 
PD 30-JAN-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 164 

ID ACC87461 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003036165-A1. 
PD 20-FEB-2003. 
PA (GETH ) GENENTECH INC. 

Query Match ..... 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 165 

ID ACC86045 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 
PN US2003027262-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 166 

ID ACA65533 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 



Length 297.4; 



SEQ ID NO: 521 



Length 2974; 



143, 
No. 



2; DB 8; 
5.7e-31; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



143.2; DB 8; 
No. 5.76-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



PN US2003032110-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 167 

ID ACA94350 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036142-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 



Length 2974; 



Length 2974; 



48.3%; Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



Best Local Similarity 
RESULT 168 

ID ACA98094 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003036145-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 169 

ID ACA91596 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003036154-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 170 

ID ACA90810 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003036153-A1 . 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 171 

ID ACD16357 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003044931-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 172 

ID ACD17518 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036150-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 173 

ID ACC92175 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US20(j3040069-Al. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 174 

ID ACA75032 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003022293-A1. 



Score 143.2; DB 8; 
Pred. ,No. 5.7e-31; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



Score 143.2; DB 8; 
Pred. No. 5.7e-31; 



PD 30-JAN-2003. 

Query Match 6.2%; 

Best Local Similarity 48,3%; 
RESULT 175 

ID ACA91903 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003032128-A1 . 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 176 . 

ID ACA71547 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003032116-A1. 

PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 

Best Local Similarity 48.3%; Pred- No. 5.7e-31; 

RESULT 177 

ID ACC90947 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID 
PN US2003032122-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 178 

ID ACA65957 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO protein #261. 
PN US2003036139-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 179 

ID ACA95102 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003017541-A1. 
PD 23-JAN-2003. 

Query Match 6.2%; Score 143.2; DB 8; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 180 

ID ACD16664 standard; cDNA; 2974 BP. 

DE Hximan secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003017543-A1. 

PD 23-JAN-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT IBl 

ID ACD15743 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #2 61. 

PN US2003036152-A1. 

PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 8; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 182 

ID ABX16846 standard; cDNA; 2974 BP. 

DE Human cDNA encoding secreted/transmembrane protein #261. 
PN US2002127584-A1. 
PD 12-SEP-2002. 



Length 2974; 



Length 2974; 



2974; 



NO: 521. 



Length 2974; 



Length 2974; 



Length 2974; 



2974; 



2974; 



Score 143.2; DB 8; Length 2974; 
Pred. No. 5.7e-31; 



DB 9; 
5.7e-31; 



Length 2974; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 183 

ID ACA97787 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003032115-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; 

Best Local Similarity 48.3%; Pred. 
RESULT 184 

ID ACA99236 standard; cDNA; 2974 BP. 

DE Novel human secreted and transmembrane protein PR06239 cDNA. 
PN US2003032140-A1. 
PD 13-FEB-2003. 
Query Match 
Best Local Similarity 
RESULT 185 

ID ACC91868 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003040076-A1. 
27-FEB-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 186 

ID ACD11279 standard; cDNA; 2974 BP. 

Novel human secreted and transmembrane protein PR06239 cDNA. 
US2003008352-A1. 
09-JAN-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 187 

ID ACD15129 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003044922-A1. 
PD 06-MAR-2003. 

Query Match 
Best Local Similarity 
RESULT 188 

ID ACD11893 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003032118-A1. 
PD 13-FEB-2003. 
Query Match 
Best Local Similarity 
RESULT 189 

ID ACC96022 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003036135-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 190 

ID ACF16585 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length..2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



PN US2003054455-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 191 

ID ACF02703 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003049741-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 192 

ID ACF03010 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003049743-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Score 143.2; DB 9; Length 2974; 



Pred. No. 5.7e-31; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 193 

ID ACF21597 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003049769-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 



SEQ ID NO: 521 



Score 143.2; DB 9; Length 2974; 



Pred. No. 5.7e-31; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 194 

ID ACF10281 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003068743-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 



SEQ ID NO: 521 



Score 143.2; DB 9; Length 2974; 



Pred. No. 5.7e-31; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 195 

ID ACF78174 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003054479-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 196 ....... 

ID ACD46879 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003068685-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



SEQ ID NO: 521 



Length 2974; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 197 

ID ACD49642 standard; cDNA; 2974 BP. 

DE Hiiman secreted/transmembrane protein (PRO) cDNA #2 
PN US2003068725-A1. 



Score 143.2; DB 9; Length 2974; 
Pred. No. 5.7e-31; 



61. 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



DE 
PN 
PD 
PA 



PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 198 

ID ACF28409 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, SEQ ID 
US2003068752-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 199 

ID ACD89099 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261. 
US2003068682-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 200 

ID ACD84494 standard; cDNA; 2974 BP. 
Human PRO polynucleotide #261. 
US2003068701-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 201 

ID ACD99268 standard; cDNA; 2974 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003068755-A1. 
10-APR'-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 202 

ID ADA78273 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261- 
US2003073180-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 203 

ID ACF49010 standard; cDNA;.,2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID 

PN US2003104539-A1. 

PD 05-JUN-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 204 

ID ACD09330 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036131-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 



Length 2974; 



NO:521. 



2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



2974; 



2974; 



2974; 



2974; 



NO:521, 



2974; 



2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7(e-31; 



Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 205 

ID ACF12123 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040075-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 206 

ID ACF41357 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003054459-A1 . 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 207 

ID ACF15971 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003044930-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 208 

ID ACF16278 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040071-A1. 
PD 27-FEB-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 209 

ID ACD32105 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 

PN US2003054471-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 210 

ID ACF18913 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003064452-A1. 
PD 03-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score.v-143 . 2 ; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 211 

ID ACF09360 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068705-A1. 
PD 10--APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 212 

ID ACF78481 standard; cDNA; 2974 BP. 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



DE Human secreted polypeptide PR06239 

PN US2003054473-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 

RESULT 213 

ID ACF52080 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239 
PN US2003064440-A1. 
PD 03-APR-2003. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 214 

ID ACF26567 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239- 
US2003068704-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 215 

ID ACF24360 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239- 
US2003068722-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 216 

ID ACF63671 standard; cDNA; 2974 BP. 
Human secreted polypeptide PR06239 
US2003073183-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 217 

ID ACF50545 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239- 
US2003104549-A1. 
05-JUN-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 218 

ID ACH08016 standard; cDNA; 2974 BP. 

Hxaman secreted/transmembrane protei 
US2003049749-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 219 

ID .ACF13822 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239 
PN US2003064462-A1. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



-encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; Length 2974; 
No. 5.7e-31; 



-encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; Length 2974; 
No. 5.7e-31; 



encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; Length 2974; 
No. 5.7e-31; 



encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; Length 2974; 
No. 5.7e-31; 



encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; Length 2974; 
No. 5.7e-31; 



encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; Length 2974; 
No. 5.7e-31; 



n (PRO) cDNA #261. 



143.2; . DB 9; Length 2974; 
No. 5.7e-31; 



encoding cDNA, SEQ ID NO: 521 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



DE 
PN 
PD 
PA 



cDNA #261. 



DE 
PN 
PD 
PA 



PD 03-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 220 

ID ACD41748 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) 
US2003065159-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 221 

ID ACF32161 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003064447-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 222 

ID ACF23439 standard; cDNA; 297 4 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003073184-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 223 

ID ACF40129 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003064463-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 224 

ID ACD45651 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261 
US2003064451-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 225 

ID ACF53308 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068721-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 226 

ID ACF27488 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068699-A1. 
PD lO-APR-2003. 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



SEQ ID NO: 521. 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 227 

ID ACF45326 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003068707-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 228 

ID ACF29944 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003073175-A1. 
PD • 17-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31;. 
RESULT 229 

ID ACD90020 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003068695-A1 . 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Score 143.2; DB 9; Length 2974; 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 230 

ID ACD84801 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003068703-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 231 

ID ACD98961 standard; cDNA; 2974 BP. 

DE cDNA encoding human PRO polypeptide #261. 

PN US2003068732-A1 . 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 232 

ID ACF77253 standard; cDNA; 2974 BP. 

DE Human secreted .polypeptide PR06239-encoding cDNA, 

PN US2003082717-A1. 

PD Ol-MAY-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 233 

ID ACF76946 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003104548-A1. 

PD 05-JUN-2003. 

PA (GETH ) GENENTECH INC. 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521, 



Query Match 



6.2%; Score 143.2; DB 9; Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



(PRO) cDNA #261. 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Best Local Sindlarity 48.3%; Pred. No. 5.7e-31; 

RESULT 234 

ID ACF49931 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104542-A1. 
PD 05-JUN-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 235 

ID ACF50238 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104543-A1. 
PD 05-JUN-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 236 

ID ACD09637 standard; cDNA; 2974 BP. 
DE Human secreted/transmembrane protein 
PN US2003036127-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 237 

ID ACD08716 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003040061-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 238 

ID ACF12430 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US20.03036130-A1. 
PD 20-FEB-2003. 

Query Match 6.2%;. 

Best Local Similarity 48.3%; 
RESULT 239 

ID ACC94938 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003054468-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 240 

ID ACD22657 standard; cDNA;.:.2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 

PN US2003054470-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred- No. 5.7e-31; 
RESULT "^241 

ID ACF15357 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003044917-A1. 
PD 06-MAR-2003. 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred- No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



SEQ ID NO: 521 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Query Match 6.2%; 
Best Local Similarity - 48.3%; 
RESULT 242 

ID ACC97452 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003044929-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 243 

ID ACC92482 standard; cDNA; 2974 BP- 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US20030598B0-A1 . 
PD 27-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e~31; 
RESULT 244 

ID ACF14129 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, SEQ ID 
US2003064465-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 245 

ID ACF14436 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, SEQ ID 
US2003054478-A1. 
20~MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 246 

ID ACF09667 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, SEQ ID 
US2003068718-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e~31; 
RESULT 247 

ID ACD45958 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261. 
US2003064454-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 248 

ID ACD48107 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003064461-A1. 

PD 03~APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



SEQ ID NO:521. 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



NO:521, 



Length 2974; 



NO: 521, 



2974; 



NO: 521. 



2974; 



2974; 



2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



RESULT 249 

ID ACD67838 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003068724-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 250 

ID ACF25646 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068727-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 251 

ID ACF29330 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068772-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 252 

ID ACD85108 standard; cDNA; 2974 BP. 

Human secreted/ transmembrane protein (PRO) cDNA #261. 
US2003068714-A1. 
10~APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 253 

ID ACD84187 standard; cDNA; 2974 BP. 
Human PRO polynucleotide #261. 
US2003068758-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 254 

ID ACD88178 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261. 
US2003068776-A1. 
lO-APR-2003. 

(GETH ) GENENTECH INC. . . 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 255 

ID ACF30865 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003069407-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 256 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
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PA 



DE 
PN 
PD 
PA 



Length 2974; 



Length 2974; 



Length 2974; 



SEQ ID NO:521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



Length 2974; 



Length 2974; 



ID ACF32468 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003104555-A1. 
PD 05-JUN-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 257 

ID ACH12128 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003049768-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 258 

ID ACH12435 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003049771-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 259 

ID ACD40827 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261, 
PN US2003032134-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 260 

ID ACF18299 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003054481-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 261 

ID ACF08746 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003049778-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best..Local Similarity 48.3%; Pred. No. 5.7e-31; ..... 
RESULT 262 

ID ACF31547 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003049782-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred- No. 5.7e-31; 
RESULT 263 

ID ACF52387 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



6.2%; Score 143.2; DB 9; 
48.3%; Pred. No. 5.7e-31; 



Length 2974; 



Length 2974; 



Length 2974; 



PN US2003054476-A1. 
PD . 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 

Best Local Similarity 
RESULT 264 

ID ACD50256 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003068733-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 265 

ID ACF38959 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003068692-A1. 

PD 10~APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best^ Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 266 

ID ACF26874 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003068709-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 267 

ID ACF24974 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068716-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 268 

ID ACF46554 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521, 

PN US2003068740-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 14 3.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 269 ...... 



SEQ ID NO: 521, 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



Length 2974; 



ID ACF28102 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068751-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2.; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 270 

ID ACD89406 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003068684-A1. 



SEQ ID NO: 521. 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 271 

ID ACF63978 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003073179-A1. 

PD 17-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 272 

ID ACF60618 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003087374~A1. 
PD 08-MAY-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 273 

ID ACH12742 standard; cDNA; 2974 BP. 

DE cDNA encoding human PRO polypeptide #261- 

PN US2003049773-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 274 

ID ACH10165 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #2 

PN US2003049777-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 275 

ID ACD04020 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #2 
PN US2003040055-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 276 

ID ACD10558 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #2 
PN US2003036164-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 277 

ID ACF42585 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003054480-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 



Length 2974; 

SEQ ID NO: 521. 

Length 2974; 

SEQ ID NO: 521. 
Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



61, 



Length 2974; 



61. 



Length 2974; 



61. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



RESULT 278 

ID ACF18606 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003059885-A1. 
27-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 279 

ID ACF02396 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003049740-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 280 

ID ACF21904 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003049770-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 281 

ID ACF10588 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003073169-A1. 
PD 17-APR-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 282 

ID ACF34040 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003064457-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 283 

ID ACF45002 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068711-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5-7e-31; 
RESULT 284 

ID ACD90634 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #2 
US2003049745-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 285 

ID ACD91247 standard; cDNA; 2974 BP. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



SEQ ID NO: 521 

Length 2974; 

SEQ ID NO: 521 

Length 2974; 

SEQ ID NO: 521 

Length 2974; 

SEQ ID NO: 521 
Length 2974; 

SEQ ID NO: 521 

Length 2974; 

SEQ ID NO: 521 

Length 2974; 

61. 

Length 2974; 



DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003049751-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 



Length 2974; 



Length 2974; 



Score 143.2; DB 9; Length 2974; 
Pred. No. 5.7e-31; 



Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 286 

ID ACF30558 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003067478-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 287 

ID ACD87257 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003068773-A1 . 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 288 

ID ACF60311 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003073185-A1 . 

PD 17-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 289 

ID ACF46861 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 
PN US2003087373-A1. 
PD 08-MAY-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 290 

ID ACF75718 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 
PN US2003096353-A1. 
PD 22-MAY-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 291 

ID ADA80065 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003073173-A1. 
PD 17-APR-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 292 

ID ACF17378 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ SEQ ID NO: 521 
PN US2003054458-A1. 
PD 20-MAR-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 



Length 2974; 



Length 2974; 



Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 293 

ID ACF23132 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003059886-A1. 

PD 27-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 294 

ID ACF08132 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003049758-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5,7e-31; 
RESULT 295 

ID ACF08439 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003049772-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 296 

ID ACF40743 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR062 3 9- encoding cDNA, 

PN US2003064448-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 297 

ID ACF53922 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003064456-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 298 

ID ACD47186 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #2 

PN US2003068693-A1. 

PD IOtAPR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 299 

ID ACF48089 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003068735-A1 . 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



61. 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



RESULT 300 

ID ACF47475 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

FN US2003068753-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 301 

ID ACF46247 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068742-A1 . 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 302 

ID ACD86336 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261, 
PN US2003068756-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 303 

ID ACF52694 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA> 
PN US2003082715-A1. 
PD Ol-MAY-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 304 

ID ACF53001 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003082716-A1. 
PD Ol-MAY-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 305 

ID ACF64 994 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068737-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 306 

ID ACF76639 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104547-A1. 
PD 05-JUN-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 307 

ID ACF61539 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



DE 
PN 
PD 
PA 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



PN US2003096359-A1 . 
PD 22-MAY-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 308 

ID ACF61846 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003100061-A1. 
29-MAY-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 309 

ID ACD30877 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003032125-A1. 
PD 13-FEB-2003. 

Query Match 

Best Local Similarity 
RESULT 310 

ID ACD31798 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003054454-A1. 
PD 20-MAR-2003. 

Query Match 

Best Local Similarity 
RESULT 311 

ID ACD32719 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261. 
US2003054477-A1. 
20-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 312 

ID ACF17685 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003054460-A1. 
20-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 313 

ID ACF07518 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003049753-A1. 
13-MAR-2003. - 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 314 

ID ACF20676 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003049763-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



Length 2974; 



DE 
PN 
PD 

PA 



DE 
PN 
PD 

PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 315 

ID ACF21290 standard; cDNA; 2974 BP. 

DE Hximan secreted polypeptide PR06239-encoding cDNA, 
PN US2003073172-A1. 
PD 17-APR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 316 

ID ACF20983 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003073172-A1. 
PD 17-APR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 317 

ID ACD47800 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261 
US2003068700-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query. Match 6.2%; * Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 318 

ID ACF47782 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068736-A1. 
lO-APR-2003. 
(GETH ) GENENTECH. INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; . Pred. No. 5.7e-31; 
RESULT 319 

ID ACF53615 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068679-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48i.3%; Pred. No. 5.7e-31; 
RESULT 320 

ID ACD86950 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261 
US2003068767-A1. 
lO-APR-2003. 

(GETH ) GENENTECH INC. ..... 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 321 ' 

ID ACH05198 standard; cDNA; 2974 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003073182-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 322 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



DE 

PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 

PA 



DE 
PN 
PD 

.PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



Length 2974; 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



ID ACF44695 standard; cDNA; 2974 BP. ' 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003104557-A1. 

PD 05-JUN-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 323 

ID ADA81792 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003092121-A1. 
PD 15-MAY-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 324 

ID ACD22350 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261, 
PN US2003027276-A1 . 
PD 06-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 325 

ID ACD24697 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003044920-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 326 

ID ACD39900 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003027265-A1. 
PD 06-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 327 

ID ACD40207 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003054461-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 328 

ID ACF13515 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003064446-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 329 

ID ACF03317 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 
PN US2003049744-A1. 
PD 13-MAR-2003. 



Length 2974; 



Length 2974; 



Score 143.2; DB 9; 
Pred- No. 5.7e-31; 



Score 14 3.2; DB 9; 
Pred. No. 5.7e-31; 



PA (GETH ) GENENTECH INC. 

Query Match - 6.2%; 

Best Local Similarity 48.3%; 
RESULT 330 

ID ACF78788 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003049783-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 331 

ID ACF11509 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR062 3 9- encoding cDNA, 
PN US2003073171-A1. 
PD 17-APR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 332 

ID ACF50852 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003032121-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 333 

ID ACF34347 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003064458~A1. 

PD 03-APR-2003. . 

PA (GETH ) GENENTECH INC. 

Query Match . 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No, 5.7e-31; 
RESULT 334 

ID ACD46572 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #2 

PN US2003064460-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 335 

ID ACD48414 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #2 

PN US2003064464-A1. 

PD 03-APR-2003. ....... 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 336 

ID ACF27795 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003068702-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

' Best Local Similarity 48.3%; Pred. No. 5.7e-31; 



Length 2974; 

SEQ ID NO: 521 

Length 2974; 

SEQ ID NO: 521 
Length 2974; 

SEQ ID NO: 521 
Length 2974; 

SEQ ID NO: 521 

Length 2974; 

61. 

Length 2974; 
61. 

Length 2974; 
SEQ ID NO: 521 
Length 2974; 



RESULT 337 

ID ACF24667 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003068734-A1 . 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e~31; 
RESULT 338 

ID ACD85722 standard; cDNA; 2974 BP. 
DE Human secreted/transmembrane protein (PRO) 
PN US2003068719-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 



Length 2974; 



cDNA #261, 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 339 

ID ACD90327 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) 

PN US2003068729-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 



Score 143.2; DB 9; Length 2974; 
Pred. No. 5.7e-31; 



cDNA #261. 



Query Match 6.2%; 
Best Local Similarity 48.3%; 
RESULT 340 

ID ACD83880 standard; . cDNA; 2974 BP 

DE Human PRO polynucleotide #261. 

PN US2003068738-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 



Score 143.2; DB 9; Length 2974; 



Pred. No. 5.7e-31; 



6.2%; 
48.3%; 



Score 143.2; DB 9; Length 2974; 



Pred. No. 5.7e-31; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



cDNA #261. 



Query Match 
Best Local Similarity 

RESULT 341 

ID ACF49317 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104540-A1. 
PD 05-JUN-2003. 

Query Match 

Best Local Similarity 
RESULT 342 

ID ACH07402 standard; cDNA; 2974 BP. 
DE Human secreted/transmembrane protein (PRO) 
PN US2003049742-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 343 

ID ACH07709 standard; cDNA; 2974 BP. 
DE Human secreted/transmembrane protein (PRO) 
PN US2003049747-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 344 

ID ACH08323 standard; cDNA; 2974 BP. 



SEQ ID NO: 521. 



Length 2974; 



Score. 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



cDNA #261. 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE Human secreted/ transmembrane protein (PRO) cDNA #261 

PN US2003049750-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5-7e-31; 
RESULT 345 

ID ACH11514 standard; cDNA; 2974 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003049766-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 346 

ID ACH11821 standard; cDNA; 297,4 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003049767-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 347 

ID ACH10472 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261. 
US2003049779-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 348 

ID ACF01475 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040059-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 349 

ID ACF41050 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040078-A1. 
PD 27-FEB-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 350 .... 
ID ACD24390 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003044918-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 351 

ID ACD31491 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003032132-A1. 
PD 13-FEB-2003. 



Length 2974; 



Length 2974; 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 352 

ID ACF17992 standard; cDNA; 2974 BP, 

DE Human secreted polypeptide PR06239 

PN US2003054462-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 

RESULT 353 

ID ACF32775 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239 

PN US2003064445-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 

RESULT 354 

ID ACF40436 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239 

PN US2003064449-A1. 

PD 03-APR-2003. 

PA (GETH. ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 

RESULT 355 

ID ACF48396 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239 

PN US2003064441-A1. 

PD 03-APR-2003. 

Query Match 6.2%; Score 

Best Local Similarity ■ 48.3%; Pred. 

RESULT 356 

ID ACF38345 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239 

PN US2003068696-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 

RESULT 357 

ID ACF25281 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239 

PN US2003068712~A1. 

PD l.O-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 

RESULT 358 

ID ACF27181 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239 

PN US2003068730-A1 . 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 



143.2; DB 9; 
No. 5.7e-31; 



Length 2974; 



-encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; 
No. 5.7e-31; 



Length 2974; 



■encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; 
No. 5.7e-31; 



Length 29-74; 



-encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; 
No. 5.7e-31; 



Length 2974; 



-encoding cDNA, SEQ ID NO: 521 

143.2; DB 9; Length 2974; 
No. 5.7e-31; 

-encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; 
No. 5.7e-31; 



Length 2974; 



-encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; 
No. 5.7e-31; 



Length 2974; 



-encoding cDNA, SEQ ID NO: 521 



143.2; DB 9; 
No. 5.7e-31; 



Length 2974; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



DE 
PN 
PD 
PA 



cDNA #261, 



DE 
PN 
PD 
PA 



RESULT 359 

ID. • ACF29637 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003073174-A1. 
PD 17-APR-2003. 

Query Match 

Best Local Similarity 
RESULT 360 

ID ACD87871 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) 
US2003068775-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 361 

ID ACF76332 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003104545-A1. 
05-JUN-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 362 

ID ACF49624 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104541-A1. 
PD 05-JUN-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 363 

ID ACF4 4 081 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003104554-A1. 
05-JUN-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 364 

ID ACH06426 standard; cDNA; 2974 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003049762-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 365 

ID ACH06733 standard; cDNA; 2974 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003049765-A1. 
13-MAR-2003. 
(GETH ), GENENTECH INC. 
Query Match 6,2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 366 

ID ADA83590 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 



SEQ ID NO: 521 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



PN US2003049752-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 367 

ID ACC92789 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003032133-A1. 
PD 13-FEB-2003. 

Query Match 

Best Local Similarity 
RESULT 368 

ID ACC93403 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003032136-A1 . 
PD 13-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 369 

ID ACF19448 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003036129-A1. 
PD 20-FEB-2003. 

Query Match 

Best Local Similarity 
RESULT 370 

ID ACD13139 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261 
PN US2003040053-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 371 

ID ACF06597 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040057-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 372 

ID ACC94631 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003054467-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC^.^ 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 373 

ID ACC98059 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003044932-A1. 
PD 06-MAR-2003. 

Query Match 

Best Local Similarity 
RESULT 374 

ID ACC94324 standard; cDNA; 2974 BP. 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



6.2%; 
48.3%; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e~31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003027270-A1. 
PD 06-FEB-2003. 

Query Match 6,2% 

Best Local Similarity 48.3% 
RESULT 375 

ID ACF42278 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003054469-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 376 

ID ACD31184 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #2 
PN US200303212 6- Al . 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 377 

ID ACD43213 standard; cDNA; 2974 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003054463-A1. 
20-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 378 

ID ACD43520 standard; cDNA; 2974 BP. 

cDNA encoding human PRO polypeptide #261. 
US2003054466-A1. 
20-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 379 

ID ACF15050 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA^ 
US2003059879-A1. 
27-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 380 

ID ACF01782 standard; cDNA; 2974 BP... 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003049738-A1. 

PD 13-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 381 

ID ACF31854 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003064469-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



61, 



Length 2974; 



Length 2974; 



Length 2974; 

SEQ ID NO: 521 

Length 2974; 

SEQ ID NO: 521 
Length 2974; 

SEQ ID NO: 521 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



cDNA #261. 



DB 9; 
7e-31; 



Query Match 6.2%; 
Best Local Similarity 4 8.3%; 
RESULT 382 

ID ACD67531 standard; cDNA; 2974 BP. 

DE cDNA encoding human PRO polypeptide #261. 

PN US2003064453-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 383 

ID ACD48721 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #2 

PN US2003064466-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 384 

ID ACD49028 standard; cDNA; 2974 BP. 
DE Human secreted/transmembrane protein (PRO) 
PN US2003064468-A1. 
PD 03-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; 

Best Local Similarity 48.3%; Pred. No 
RESULT 385 

ID ACF51466 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003068760-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 386 

ID ACF54229 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003068769-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2;- DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 387 ■ 

ID ACF25953 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239~encoding cDNA^ 

PN US2003045700-A1. 

PD 06-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 388 

ID ACF39266 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 

PN US2003068698-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 



Length 2974; 



Length 2974; 



61. 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Query Match 



6.2%; Score 143.2; DB 9; Length 2974; 



Length 2974; 



Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 389 

ID ACF29023 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068759-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 390 

ID ACD90940 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003049748-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 391 

ID ACD86643 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003068765-A1. 

PD . lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5,7e-31; 
RESULT 392 

ID ACH05505 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003049754-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143,2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 393 

ID ACF65301 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068688-A1. 
PD 10-7^R-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 394 

ID ADB20633 ^standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003082767-A1. 
PD Ol-MAY-2003. 

Query Match 6.2% 
Best Local Similarity 48.3% 
RESULT 395 

ID ACF43774 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104552-A1. 
PD 05-JUN-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 396 



SEQ ID NO: 521. 



Length 2974; 



2974; 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



Length 2974; 



ID ACH09244 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
FN US2003049774-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No*. 5.7e-31; 
RESULT 397 

ID ACH09551 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003049775-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 398 

ID ADA78885 standard; cDNA; 2974 BP. 

Human secreted/transmembrane protein (PRO) cDNA #261. 
US2003073181-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 399 

ID ACF09974 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068720-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; . Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 400 

ID ACF51159 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521, 
US2003068739-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 401 

ID ACF24053 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521, 
US2003068763-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match ...v.. . 6.2%; Score 143.2; DB 9; Length 2974;^.^ 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 402 

ID ACD88485 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003068689-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 403 

ID ACH09858 standard; cDNA; 2974 BP. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



Score 143.2; DB 9; 
Pred. No, 5.7e-31; 



DE Human secreted/transmembrane protein (PRO) cDNA #261 
PN US2003049776-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 404 

ID ACH10779 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003049780-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 405 

ID ACD11586 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003036126-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 406 

ID ACC96636 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003044924-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 407 

ID ACC98666 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003044927-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 4 08 

ID ACF41971 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040072-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 409 

ID ACF16892 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040073-A1 . 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 410 

ID ACD32412 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 

PN US2003054475-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 



Length 2974; 



2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



2974; 



(PRO) cDNA #261. 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



cDNA #261. 



RESULT 411 

ID ACD30570 standard; cDNA; 2974 BP. 
DE Human secreted/transmembrane protein 
PN US2003032124-A1. 
PD 13-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 412 

ID ACD41441 standard; cDNA; 2974 BP. 
DE Human secreted/transmembrane protein (PRO) 
PN US2003064467-A1. 
PD 03-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 413 

ID ACF07825 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 

PN US2003049759-A1. 

PD 13~MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 414 



Score 143.2; DB 9; Length 2974; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



ID ACF31240 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003064455-A1. 
PD 03-APR"2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 415 

ID ACF77560 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521, 

PN US2003054465-A1. 

PD 20-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 416 

ID ACF11202 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003073170-A1. 
PD 17-APR-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 417 

ID ACF33082 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521, 

PN US2003073176-A1. 

PD 17-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5-7e-31; 
RESULT 418 

ID ACF26260 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



PN US2003068717-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 14 3.2; DB 9; 

Best Local Similarity 48.3%; Fred. No. 5.7e-31; 
RESULT 419 

ID ACD83573 standard; cDNA; 2974 BP. 
DE Human PRO polynucleotide #261. 
PN US2003068728-A1. 
PD lO-APR-2003. 
PA ' (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 420 

ID ACF23746 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068764-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 421 

ID ACF43160 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104550-A1 . 
PD 05-JUN-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 422 

ID ACF43467 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104551-A1. 
PD 05-JUN-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 423 

ID ACH06119 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003049761-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 424 .. ... 

ID ACH08937 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 

PN US2003049757-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 425 

ID ACC90531 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003027273-A1. 



Length 2974; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



Length 2974; 



SEQ ID NO: 521 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



PD 06-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 426 

ID ACF10895 standard; cDNA; 2974 BP. 
DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003036119-A1 . 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 427 

ID ACC93710 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003036120-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 428 

ID ACC96329 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003036161-A1. 
PD 20-FEB-2003. 
PA (GETH ).GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 429 

ID ACD25004 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261. 
PN US2003044921-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 430 

ID ACF02089 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003049739-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9;, 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 431 

ID ACF22211 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003059882-A1. 
27-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 432 

ID ACF22825 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003059884-A1. 
27-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 433 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



ID ACF09053 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068687-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 434 

ID ACF33389 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003073186-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 435 

ID ACF54843 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003064443-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 436 

ID ACF48703 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003064444-A1. 
03-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 437 

ID ACD47493 standard; cDNA; 2974 BP. 

Human secreted/ transmembrane protein (PRO) cDNA #261. 
US2003068697-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 438 

ID ACD49335 standard; cDNA; 2974 BP. 

Human secreted/ transmembrane protein (PRO) cDNA #261. 
US2003068710-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match . 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No, 5.7e-31; 
RESULT 439 

ID ACF38038 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068686-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match ' 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 440 

ID ACF30251 standard; cDNA; 2974 BP. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521, 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



SEQ ID NO: 521. 



Length 2974; 



2974; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 2974;., 



SEQ ID NO: 521. 



Length 2974; 



DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003073178-A1. 
PD 17-APR-2003. 
PA {GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 441 

ID ACD87564 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003068774-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 442 

ID ACF62153 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104538-A1. 
PD 05-JUN-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity .48.3%; Pred. No. 5.7e-31; 
RESULT 443 

ID ACH11086 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003049781-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 

RESULT 444 

ID ACD10251 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003036158-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; 

Best Local Similarity 48.3%; 
RESULT 445 

ID ACD16976 standard; cDNA; 2974 BP. 
DE cDNA encoding human PRO polypeptide #261. 
PN US2003036151-A1 . 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 44 6 

ID ACC99273 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003040067-A1. 
PD 27-FEB-2003. 

Query Match - 6.2%; 

Best Local Similarity 48.3%; 
RESULT 447 

ID ACF00667 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003054456-A1. 
PD 20-MAR-2003. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



Score 143.2; DB 9; 
Pred. No. 5.7e-31; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



RESULT 448 

ID ACD41134 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
FN US2003054482-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 449 

ID ACF14743 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003054457-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 450 

ID ACF22518 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003059883-A1. 

PD 27-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 451 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



ID ACF79095 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003049764-A1. 
PD 13-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 452 

ID ACF11816 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003073177-A1. 
PD 17-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 453 

ID ACF51773 standard; cDNA; 2974 BP. 

DE Hiiman secreted polypeptide PR06239-encoding cDNA, 
PN US2003064442-A1. 
PD 03-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 454 

ID ACF33696 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003064450-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 455 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



.2; DB 9; 
5.7e-31; 



2; DB 9; 
5.7e-31; 



Length 2974; 



SEQ ID NO: 521 



Length 2974; 



Length 2974; 



ID ACD49949 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261 
PN US2003068731-A1 . 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143 

Best Local Similarity 48.3%; Pred. No. 
RESULT 456 

ID ACF37731 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA^ 
PN US2003068683-A1. 
PD lO-APR-2003. 

Query Match 6.2%; Score 143< 

Best Local Similarity 48.3%; Pred. No. 
RESULT 457 

ID ACF28716 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003068754-A1. 

PD lO-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 458 

ID ACD88792 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261, 
PN US2003068681-A1. 
PD lO-APR-2003. 

Query Match 6.2%; Score 

Best Local Similarity 48.3%; Pred. 
RESULT 459 

ID ACF75411 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003096351-A1. 

PD 22-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 460 

ID ACF61232 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003096358-A1. 

PD 22-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 461 



143.2; DB 9; 
No. 5.7e-31; 



Length 2974; 



Length 2974; 



ID ACF44388 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003104556-A1. 
PD 05-JUN-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 462 

ID ACH08630 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003049756-A1. 



SEQ ID NO: 521 



Length 2974; 



PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 463 

ID ACC94017 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 
PN US2003036122-A1. 
PD 20-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 464 

ID ACD21122 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003036121-A1 . 

PD 20-FEB-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 4 65 

ID ACF06904 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 
PN US2003040065-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 466 

ID ACD20815 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003044919-A1. 
PD 06-MAR-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 467 

ID ACD22964 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 
PN US2003040077-A1. 
PD 27-FEB-2003. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 4 68 

ID ACF41664 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 
PN US200304 4928-A1. 
PD 06-M7VR-2003. 

Query Match 6.2%; Score 143.2; DB 9; -^.Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 469 

ID ACF07211 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, SEQ ID NO: 521 

PN US2003049746-A1. 

PD 13-MAR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; Length 2974; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 470 

ID ACF77867 standard; cDNA; 2974 BP. 



DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003054464-A1. 
PD 20-MAR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 471 

ID ACD46265 standard; cDNA; 2974 BP. 

DE Human secreted/ transmembrane protein (PRO) cDNA #261. 

PN US2003064459-A1. 

PD 03-APR-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5-7e-31; 
RESULT 472 

ID ACF47168 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068757-A1. 
PD lO-APR-2003. 
PA (GETH )• GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 473 

ID ACF54536 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068723-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 474 

ID ACF45940 standard; cDNA; 2974 BP. 

Human secreted polypeptide PR06239-encoding cDNA, 
US2003068741-A1. 
lO-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 475 

ID ACF45633 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068744-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Locals Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 476 

ID ACF38652 standard; cDNA; 2974 BP. 

DE Human secreted polypeptide PR06239-encoding cDNA, 
PN US2003068766-A1. 
PD lO-APR-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 6.2%; Score 143.2; DB 9; 

Best Local Similarity 48.3%; Pred. No. 5.7e-31; 
RESULT 477 

ID ACD89713 standard; cDNA; 2974 BP. 

DE Human secreted/transmembrane protein (PRO) cDNA #261 
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